[Doc]Add benchmark scripts #74

Potabk · 2025-02-17T09:01:42Z

What this PR does / why we need it?

The purpose of this PR is to add benchmark scripts for npu, developers can easily run performance tests on their own machines with one line of code .

Does this PR introduce any user-facing change?

How was this patch tested?

Signed-off-by: wangli <[email protected]>

Yikun

emm, only review on tutorials.md and bechmark_latency.py.

The problem is that should we copy vllm benchmark here or just use it?

Yikun · 2025-02-27T15:57:47Z

docs/source/tutorials.md

@@ -308,4 +308,54 @@ Logs of the vllm server:
 ```
 INFO:     127.0.0.1:59384 - "POST /v1/completions HTTP/1.1" 200 OK
 INFO 02-19 17:37:35 metrics.py:453] Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 1.9 tokens/s, Running: 0 reqs, Swapped: 0 reqs, Pending: 0 reqs, GPU KV cache usage: 0.0%, CPU KV cache usage: 0.0%.
+```
+
+## Performance Benchmark


This should be developer guide

Yikun · 2025-02-27T16:05:29Z

benchmarks/backend_request_func.py

@@ -0,0 +1,193 @@
+# SPDX-License-Identifier: Apache-2.0


# # Copyright (c) 2025 Huawei Technologies Co., Ltd. All Rights Reserved. # This file is a part of the vllm-ascend project. # Adapted from vllm-project/vllm/benchmarks/backend_request_func.py # Copyright 2023 The vLLM team. # # Licensed under the Apache License, Version 2.0 (the "License"); # you may not use this file except in compliance with the License. # You may obtain a copy of the License at # # http://www.apache.org/licenses/LICENSE-2.0 # # Unless required by applicable law or agreed to in writing, software # distributed under the License is distributed on an "AS IS" BASIS, # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. # See the License for the specific language governing permissions and # limitations under the License. #

Yikun · 2025-02-27T16:06:36Z

benchmarks/backend_request_func.py

+            **kwargs,
+        )
+
+ASYNC_REQUEST_FUNCS = {


Pls add note for the different with vLLM.

Yikun · 2025-02-27T16:06:49Z

benchmarks/benchmark_latency.py

@@ -0,0 +1,152 @@
+# SPDX-License-Identifier: Apache-2.0


Potabk changed the title ~~[Misc]dd benchmark scripts~~ [Misc]Add benchmark scripts Feb 17, 2025

Potabk changed the title ~~[Misc]Add benchmark scripts~~ [Misc][WIP]Add benchmark scripts Feb 17, 2025

Potabk force-pushed the benchmarks branch from da35d59 to 759243e Compare February 26, 2025 07:39

add benchmark doc and scripts

1493751

Signed-off-by: wangli <[email protected]>

Potabk force-pushed the benchmarks branch from 759243e to 1493751 Compare February 26, 2025 07:40

Potabk changed the title ~~[Misc][WIP]Add benchmark scripts~~ [Doc]Add benchmark scripts Feb 26, 2025

Potabk added 2 commits February 26, 2025 15:54

fix isort

fcbcf88

Signed-off-by: wangli <[email protected]>

fix ci

7ca9d2f

Signed-off-by: wangli <[email protected]>

Potabk force-pushed the benchmarks branch from 76e7ee3 to 7ca9d2f Compare February 26, 2025 08:32

Potabk added 3 commits February 26, 2025 16:36

fix shellcheck

5b1d939

Signed-off-by: wangli <[email protected]>

fix doc

eac3514

Signed-off-by: wangli <[email protected]>

fix doc

4d13e93

Signed-off-by: wangli <[email protected]>

Yikun requested changes Feb 27, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Doc]Add benchmark scripts #74

[Doc]Add benchmark scripts #74

Potabk commented Feb 17, 2025 •

edited

Loading

Yikun left a comment

Yikun Feb 27, 2025

Yikun Feb 27, 2025

Yikun Feb 27, 2025

Yikun Feb 27, 2025

[Doc]Add benchmark scripts #74

Are you sure you want to change the base?

[Doc]Add benchmark scripts #74

Conversation

Potabk commented Feb 17, 2025 • edited Loading

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Yikun left a comment

Choose a reason for hiding this comment

Yikun Feb 27, 2025

Choose a reason for hiding this comment

Yikun Feb 27, 2025

Choose a reason for hiding this comment

Yikun Feb 27, 2025

Choose a reason for hiding this comment

Yikun Feb 27, 2025

Choose a reason for hiding this comment

Potabk commented Feb 17, 2025 •

edited

Loading