[RFC] A kv-cache aware & responsive pod autoscaler for GenAI Workloads #24
Labels
area/autoscaling
kind/enhancement
New feature or request
priority/critical-urgent
Highest priority. Must be actively worked on as someone's top priority right now.
Milestone
place holder to customized autoscaler
The text was updated successfully, but these errors were encountered: