Skip to main content

CNCF: SandboxLatest Release: v2.9.0

Heterogeneous GPU Sharing on Kubernetes

HAMi is an open-source, cloud-native GPU virtualization middleware that brings sharing, isolation and scheduling of heterogeneous accelerators to AI workloads on Kubernetes.

GPU SlicingHeterogeneous AcceleratorsKubernetes-native Scheduling

Quick Start Join Community

AI Workloads

DeepSeek

vLLM

Xinference

LLM

ML

HPC

Kubernetes Scheduling Ecosystem

Kubernetes

Volcano

Kueue

Koordinator

Virtualization • Sharing • Isolation • Scheduling

GPU

1/2

1/4

1/N

Heterogeneous Accelerators

NVIDIA

Huawei Ascend

Cambricon

Hygon

Enflame

Iluvatar

Kunlunxin

Moore Threads

MetaX

AWS Neuron

Vaststream

and more...

CNCF Sandbox Project

HAMi is a CNCF Sandbox project

HAMi is a Sandbox project of the Cloud Native Computing Foundation (CNCF), listed in both the CNCF Landscape and the CNAI Landscape.

Why HAMi

Heterogeneous Management

Manage and schedule GPU, NPU, MLU, and other accelerators in one workflow.

Hard Isolation

Slice memory and compute precisely with hard isolation at runtime.

Advanced Scheduling

Use binpack, spread, and topology-aware policies for better placement.

Kubernetes Native

Work with Kubernetes APIs, DRA, and CDI for easier adoption.

Resource Isolation & QoS

Control memory and core quotas for fairer and more stable sharing.

Unified Monitoring

Provide consistent metrics and visibility across device vendors.

Architecture & How It Works

From request to isolation, HAMi turns GPU slicing and heterogeneous scheduling into usable Kubernetes runtime paths.

HAMi Runtime Mechanism

Request Entry / Runtime Interface

PodSpec + Device Plugin / DRA + CDI

Control Plane

Decision path

MutatingWebhookAdmission Entry

HAMi SchedulerPolicy / Topology

Device Binding DecisionTarget GPU Selected

Data Plane

Enforcement Path

Device Plugin + CDI InjectionDevice Attached

HAMi CoreMemory / Core Isolation

Container WorkloadExecution Starts

Resource Semantics

nvidia.com/gpu+gpumem/gpucores

Before and After Using HAMi

Compare traditional whole-GPU allocation with HAMi GPU sharing under the same workloads.

Before and After Using HAMi

Ecosystem & Device Support

Broad accelerator ecosystem across vendors. See docs for full support matrix.

View full supported devices list →

Adopters

The organizations below are evaluating or using HAMi in production environments.

Join the adopters list

Submit your organization through the contributor guide process.

See submission instructions →

Contributors

HAMi is advanced by contributors from the community and industry. These organizations actively participate in project development and ecosystem collaboration.

Global Community Metrics

A live snapshot of HAMi community growth and open-source momentum.

GitHub Stars

3.5K

Docker Pulls

162K

Contributors

500+

Contributor Countries

17

Star HAMi on GitHub Join Community

CNCF

HAMi is a CNCF Sandbox project