Skip to content

Enable multi-GPU inference in vLLM with tensor parallelism#105

Merged
ApostaC merged 2 commits intovllm-project:mainfrom YuhanLiu11:tp-shmFeb 11, 2025

Commits

Commits on Feb 11, 2025