Use vllm metrics for routing #274

varungup90 · 2024-10-04T21:40:48Z

No description provided.

Jeffwan · 2024-10-05T05:11:02Z

pkg/cache/cache.go

+	return metricValue, nil
+}
+
+func parseMetricFromBody(body []byte, metricName string) (float64, error) {


autoscaler has similar features. Let's refactor this part later and make sure cache and autoscaler fetcher can use same library

Agree. Let me discuss with him

pkg/cache/cache.go

pkg/plugins/gateway/algorithms/least_request.go

Jeffwan · 2024-10-05T05:14:26Z

PR looks good to me.

varungup90 · 2024-10-07T18:43:17Z

PR looks good to me.

Updated the MR with cache to pull metrics once for each pod.

* Cache bug fix in update pod and model mapping (#259) * test * Use vllm metrics for routing * nit reverts * update log level * refactor cache to fetch metrics once * remove port from random routing

varungup90 added 5 commits September 30, 2024 13:30

Cache bug fix in update pod and model mapping (#259)

59e8a99

test

53a19ff

Merge branch 'main' into use-vllm-metric

0d197f6

Use vllm metrics for routing

dad7354

nit reverts

20b857e

varungup90 mentioned this pull request Oct 4, 2024

Improve the logging and key settings in router #267

Closed

update log level

cf93b2c

Jeffwan approved these changes Oct 5, 2024

View reviewed changes

refactor cache to fetch metrics once

eb33127

remove port from random routing

e37f01a

varungup90 merged commit c6e1c2b into main Oct 7, 2024
10 checks passed

varungup90 deleted the use-vllm-metric branch October 7, 2024 20:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use vllm metrics for routing #274

Use vllm metrics for routing #274

varungup90 commented Oct 4, 2024

Jeffwan Oct 5, 2024

varungup90 Oct 5, 2024

Jeffwan commented Oct 5, 2024

varungup90 commented Oct 7, 2024

Use vllm metrics for routing #274

Use vllm metrics for routing #274

Conversation

varungup90 commented Oct 4, 2024

Jeffwan Oct 5, 2024

Choose a reason for hiding this comment

varungup90 Oct 5, 2024

Choose a reason for hiding this comment

Jeffwan commented Oct 5, 2024

varungup90 commented Oct 7, 2024