Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEAT] enable experimental semantic cache in router #210

Merged
merged 4 commits into from
Mar 2, 2025

Conversation

rootfs
Copy link
Contributor

@rootfs rootfs commented Mar 2, 2025

This is a follow up of #202 by enabling the semantic cache in router. Please refer to this demo for usage and cache behavior.

@ApostaC @YuhanLiu11 PTAL, thanks

@rootfs rootfs force-pushed the semantic-cache-int branch from 6318962 to 30ba368 Compare March 2, 2025 14:12
@YuhanLiu11 YuhanLiu11 self-requested a review March 2, 2025 17:03
@rootfs rootfs force-pushed the semantic-cache-int branch from bfc66f9 to 15b48a5 Compare March 2, 2025 18:51
@rootfs rootfs force-pushed the semantic-cache-int branch from 15b48a5 to 05137cb Compare March 2, 2025 19:03
Copy link
Collaborator

@YuhanLiu11 YuhanLiu11 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Thanks a lot for your contribution 🎉

@YuhanLiu11 YuhanLiu11 merged commit 94b7d44 into vllm-project:main Mar 2, 2025
9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants