Skip to content

vllm-stack-0.1.0

Latest
Compare
Choose a tag to compare
@github-actions github-actions released this 03 Mar 17:31
fecae77

The stack deployment of vLLM

What's Changed

  • [Feat] add imagePullSecrets option to helm chart #179 by @kalantar
  • [Benchmark] Adding multi-round QA benchmark script #180 @YuhanLiu11
  • [Feat]: add support for embeddings, rerank and score endpoints #181 @bufferoverflow
  • [CI/Build]: bump python to 3.12 to be inline with vllm #182 @bufferoverflow
  • Manually Enable LoRA Adapters using existing Router and vLLM deployment #206 @wangchen615
  • [Feat] dynamic configuration support for router #207 @ApostaC
  • [Feat] create kubernetes operator to manage dynamic config file #208 @rootfs
  • [Document, Feat] basic HPA support and tutorials #209 @ApostaC
  • [Feat] enable experimental semantic cache in router #210 @rootfs

New Contributors