Skip to main content
← Back to index page

v2.9.0

anandj91
archlitchi
ashergaga
Atroxgod
CFH2436
charford
CoderTH
dongjiang1989
DSFans2014
FouoF

Major features

  • Add HAMi-core mode for Ascend devices, enabling user-space virtualization for fine-grained memory and compute sharing.
  • Optimize HAMi-core performance and add the latest benchmark data for HAMi-core.
  • HAMi-DRA for NVIDIA is ready for use.
  • Sync Volcano vGPU Device Plugin with version 0.19 and add CDI support.
  • Add HAMi skills for debugging and development workflows.
  • Support module-pair allocation for Ascend 910C devices in SuperPod environments by (@ashergaga) in #1610
  • Add support for Vast.ai devices by (@DSFans2014) in #1645
  • Add Ascend ResourceCoreName and Ascendxxx-core resources to support hami-vnpu-core virtualization by (@ashergaga) and (@DSFans2014) in #1771 and #1804
  • Support node filtering based on hami-vnpu-core annotations and multi-device requests with hami-vnpu-core enabled by (@ashergaga) in #1812 and #1837

Major bug fixes

  • Fix initialization errors when using tensor parallelism on vLLM versions greater than 0.18.
  • Fix schedulerName precedence checks by (@hoteye) in #1627
  • Add nil checks to prevent leader election panics by (@haitwang-cloud) in #1603
  • Fix panic on nil resource requests in scheduler scoring by (@yxxhero) in #1626
  • Fix reversed binpack and spread scheduling policies for Iluvatar devices by (@qiangwei1983) in #1631
  • Resolve cardinality explosion in Device_memory_desc_of_container by (@maishivamhoo123) in #1628
  • Handle GetMemoryInfo ERROR_NOT_SUPPORTED for unified memory GPUs by (@jsl9208) in #1637
  • Optimize nodelock scalability with exponential backoff and listers by (@maishivamhoo123) in #1663
  • Fix readiness probes when replicas are greater than one by (@Shouren) in #1677
  • Fix scheduler slot usage prediction and device type filtering by (@maishivamhoo123) in #1700
  • Retain terminating pods in cache to prevent premature eviction by (@maishivamhoo123) in #1719
  • Fix multi-container allocation when init containers are present by (@haitwang-cloud) in #1650
  • Align kubelet allocation with scheduler annotations by (@xrwang8) in #1743
  • Handle Linux kernel 6.17 handshake edge cases in NVIDIA health checks by (@maishivamhoo123) in #1810
  • Fix MIG allocation failures in CDI mode by (@DSFans2014) in #1826

What's changed

Other changes

New contributors

Committers: Contributors

Full Changelog: https://github.com/Project-HAMi/HAMi/compare/v2.8.0...v2.9.0

CNCFHAMi is a CNCF Sandbox project