The stack deployment of vLLM
What's Changed
- [Feat] add imagePullSecrets option to helm chart #179 by @kalantar
- [Benchmark] Adding multi-round QA benchmark script #180 @YuhanLiu11
- [Feat]: add support for embeddings, rerank and score endpoints #181 @bufferoverflow
- [CI/Build]: bump python to 3.12 to be inline with vllm #182 @bufferoverflow
- Manually Enable LoRA Adapters using existing Router and vLLM deployment #206 @wangchen615
- [Feat] dynamic configuration support for router #207 @ApostaC
- [Feat] create kubernetes operator to manage dynamic config file #208 @rootfs
- [Document, Feat] basic HPA support and tutorials #209 @ApostaC
- [Feat] enable experimental semantic cache in router #210 @rootfs