[Doc] Add vllm-ascend usage doc & fix doc format #53

shen-shanshan · 2025-02-12T09:03:52Z

What this PR does / why we need it?

Add vllm-ascend tutorial doc for Qwen/Qwen2.5-7B-Instruct model serving doc
fix format of files in docs dir, e.g. format tables, add underline for links, add line feed...

Does this PR introduce any user-facing change?

no.

How was this patch tested?

no.

shen-shanshan · 2025-02-12T09:06:35Z

cc:

@Yikun @wangxiyuan @MengqingCao

wangxiyuan · 2025-02-12T09:15:32Z

No need to update installation and quick start doc. They will be updated in new PR.

shen-shanshan · 2025-02-12T09:18:51Z

No need to update installation and quick start doc. They will be updated in new PR.

ok.

docs/source/index.md

docs/source/installation.md

docs/source/quick_start.md

docs/source/running_vllm_with_ascend.md

Signed-off-by: Shanshan Shen <[email protected]>

Yikun

Overall, it has been greatly improved compared to the previous version, thank you!

Yikun · 2025-02-14T12:45:36Z

docs/source/tutorials.md

+```bash
+# Use Modelscope mirror to speed up model download
+export VLLM_USE_MODELSCOPE=True
+export MODELSCOPE_CACHE=/root/models/


Suggested change

export MODELSCOPE_CACHE=/root/models/

you can use default cache -v /root/.cache:/root/.cache

Yikun · 2025-02-14T12:52:23Z

docs/source/tutorials.md

+-v /usr/local/Ascend/driver/lib64/:/usr/local/Ascend/driver/lib64/ \
+-v /usr/local/Ascend/driver/version.info:/usr/local/Ascend/driver/version.info \
+-v /etc/ascend_install.info:/etc/ascend_install.info \
+-v /root/models:/root/models \


Suggested change

-v /root/models:/root/models \

-v /root/.cache:/root/.cache \

Yikun · 2025-02-14T12:52:46Z

docs/source/tutorials.md

+-v /usr/local/Ascend/driver/lib64/:/usr/local/Ascend/driver/lib64/ \
+-v /usr/local/Ascend/driver/version.info:/usr/local/Ascend/driver/version.info \
+-v /etc/ascend_install.info:/etc/ascend_install.info \
+-v /root/models:/root/models \


Suggested change

-v /root/models:/root/models \

-v /root/.cache:/root/.cache \

Yikun · 2025-02-14T12:53:05Z

docs/source/tutorials.md

+-v /root/models:/root/models \
+-p 8000:8000 \
+-e VLLM_USE_MODELSCOPE=True \
+-e MODELSCOPE_CACHE=/root/models/ \


Suggested change

-e MODELSCOPE_CACHE=/root/models/ \

Yikun · 2025-02-14T12:53:35Z

docs/source/tutorials.md

+-v /usr/local/Ascend/driver/lib64/:/usr/local/Ascend/driver/lib64/ \
+-v /usr/local/Ascend/driver/version.info:/usr/local/Ascend/driver/version.info \
+-v /etc/ascend_install.info:/etc/ascend_install.info \
+-v /root/models:/root/models \


Suggested change

-v /root/models:/root/models \

-v /root/.cache:/root/.cache \

Yikun · 2025-02-14T12:53:43Z

docs/source/tutorials.md

+```bash
+# Use Modelscope mirror to speed up model download
+export VLLM_USE_MODELSCOPE=True
+export MODELSCOPE_CACHE=/root/models/


Suggested change

export MODELSCOPE_CACHE=/root/models/

Yikun · 2025-02-14T12:54:20Z

docs/source/tutorials.md

+def clean_up():
+    destroy_model_parallel()
+    destroy_distributed_environment()
+    gc.collect()
+    torch.npu.empty_cache()


Looks like a little bit wired, would you mind taking a look? @wangxiyuan

since this is only a simple example, no need to do

del llm clean_up()

When we using mp as distributed_executor_backend, the clean up must be done by hand, otherwise will raise error when exiting process. This is a bug in vLLM.

Signed-off-by: Shanshan Shen <[email protected]>

Yikun · 2025-02-17T06:35:08Z

After this PR merged, pls also backport this to v0.7.1 branch.

shen-shanshan marked this pull request as draft February 12, 2025 09:04

shen-shanshan force-pushed the doc branch from c71743c to ebc859c Compare February 13, 2025 12:04

shen-shanshan marked this pull request as ready for review February 13, 2025 12:04

shen-shanshan force-pushed the doc branch from ebc859c to f0816bf Compare February 14, 2025 02:53

Yikun reviewed Feb 14, 2025

View reviewed changes

MengqingCao reviewed Feb 14, 2025

View reviewed changes

docs/source/running_vllm_with_ascend.md Outdated Show resolved Hide resolved

Yikun reviewed Feb 14, 2025

View reviewed changes

docs/source/running_vllm_with_ascend.md Outdated Show resolved Hide resolved

add vllm-ascend tutorials

76fcb75

Signed-off-by: Shanshan Shen <[email protected]>

shen-shanshan force-pushed the doc branch from cbebd7b to 76fcb75 Compare February 14, 2025 10:40

Yikun approved these changes Feb 14, 2025

View reviewed changes

update tutorials

8287ea8

Signed-off-by: Shanshan Shen <[email protected]>

Yikun mentioned this pull request Feb 17, 2025

[v0.7.1rc1] FAQ & Feedback #19

Open

wangxiyuan merged commit 2a67814 into vllm-project:main Feb 17, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Doc] Add vllm-ascend usage doc & fix doc format #53

[Doc] Add vllm-ascend usage doc & fix doc format #53

shen-shanshan commented Feb 12, 2025 •

edited by Yikun

Loading

shen-shanshan commented Feb 12, 2025

wangxiyuan commented Feb 12, 2025

shen-shanshan commented Feb 12, 2025

Yikun left a comment

Yikun Feb 14, 2025

Yikun Feb 14, 2025

Yikun Feb 14, 2025

Yikun Feb 14, 2025

Yikun Feb 14, 2025

Yikun Feb 14, 2025

Yikun Feb 14, 2025

wangxiyuan Feb 16, 2025

MengqingCao Feb 17, 2025

Yikun commented Feb 17, 2025 •

edited

Loading

	-v /root/models:/root/models \
	-v /root/.cache:/root/.cache \

[Doc] Add vllm-ascend usage doc & fix doc format #53

[Doc] Add vllm-ascend usage doc & fix doc format #53

Conversation

shen-shanshan commented Feb 12, 2025 • edited by Yikun Loading

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

shen-shanshan commented Feb 12, 2025

wangxiyuan commented Feb 12, 2025

shen-shanshan commented Feb 12, 2025

Yikun left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Yikun commented Feb 17, 2025 • edited Loading

shen-shanshan commented Feb 12, 2025 •

edited by Yikun

Loading

Yikun commented Feb 17, 2025 •

edited

Loading