Lab 6: Deploy vLLM Inference Service with HAMiInstall HAMi on a GPU cluster and schedule vLLM inference services with GPU partitioning.