Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[RFC] A kv-cache aware & responsive pod autoscaler for GenAI Workloads #24

Closed
Jeffwan opened this issue Jul 13, 2024 · 0 comments · Fixed by #55
Closed

[RFC] A kv-cache aware & responsive pod autoscaler for GenAI Workloads #24

Jeffwan opened this issue Jul 13, 2024 · 0 comments · Fixed by #55
Assignees
Labels
area/autoscaling kind/enhancement New feature or request priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now.
Milestone

Comments

@Jeffwan
Copy link
Collaborator

Jeffwan commented Jul 13, 2024

place holder to customized autoscaler

@Jeffwan Jeffwan added kind/enhancement New feature or request area/autoscaling labels Jul 29, 2024
@Jeffwan Jeffwan added this to the v0.1.0 milestone Jul 29, 2024
@Jeffwan Jeffwan added the priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now. label Jul 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/autoscaling kind/enhancement New feature or request priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now.
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants