You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
使用quay.io/ascend/vllm-ascend:v0.7.1rc1镜像,ray版本2.43.0,环境搭建应该没问题,已经可以四机跑通如Qwen2.5-72B-Instruct等权重,已解决跑Deepseek_v3时报错Torch not compiled with CUDA enabled,出现新报错,报错信息如下:
ERROR 02-28 09:16:29 worker_base.py:572] Error executing method 'determine_num_available_blocks'. This might cause deadlock in distributed execution.
ERROR 02-28 09:16:29 worker_base.py:572] Traceback (most recent call last):
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/vllm/worker/worker_base.py", line 564, in execute_method
ERROR 02-28 09:16:29 worker_base.py:572] return run_method(target, method, args, kwargs)
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/vllm/utils.py", line 2208, in run_method
ERROR 02-28 09:16:29 worker_base.py:572] return func(*args, **kwargs)
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 116, in decorate_context
ERROR 02-28 09:16:29 worker_base.py:572] return func(*args, **kwargs)
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/vllm_ascend/worker.py", line 226, in determine_num_available_blocks
ERROR 02-28 09:16:29 worker_base.py:572] self.model_runner.profile_run()
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 116, in decorate_context
ERROR 02-28 09:16:29 worker_base.py:572] return func(*args, **kwargs)
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/vllm_ascend/model_runner.py", line 1357, in profile_run
ERROR 02-28 09:16:29 worker_base.py:572] self.execute_model(model_input, kv_caches, intermediate_tensors)
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 116, in decorate_context
ERROR 02-28 09:16:29 worker_base.py:572] return func(*args, **kwargs)
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/vllm_ascend/model_runner.py", line 1139, in execute_model
ERROR 02-28 09:16:29 worker_base.py:572] hidden_or_intermediate_states = model_executable(
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1736, in _wrapped_call_impl
ERROR 02-28 09:16:29 worker_base.py:572] return self._call_impl(*args, **kwargs)
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1747, in _call_impl
ERROR 02-28 09:16:29 worker_base.py:572] return forward_call(*args, **kwargs)
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/vllm/model_executor/models/deepseek_v3.py", line 682, in forward
ERROR 02-28 09:16:29 worker_base.py:572] hidden_states = self.model(input_ids, positions, kv_caches,
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1736, in _wrapped_call_impl
ERROR 02-28 09:16:29 worker_base.py:572] return self._call_impl(*args, **kwargs)
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1747, in _call_impl
ERROR 02-28 09:16:29 worker_base.py:572] return forward_call(*args, **kwargs)
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/vllm/model_executor/models/deepseek_v3.py", line 638, in forward
ERROR 02-28 09:16:29 worker_base.py:572] hidden_states, residual = layer(positions, hidden_states,
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1736, in _wrapped_call_impl
ERROR 02-28 09:16:29 worker_base.py:572] return self._call_impl(*args, **kwargs)
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1747, in _call_impl
ERROR 02-28 09:16:29 worker_base.py:572] return forward_call(*args, **kwargs)
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/vllm/model_executor/models/deepseek_v3.py", line 565, in forward
ERROR 02-28 09:16:29 worker_base.py:572] hidden_states = self.mlp(hidden_states)
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1736, in _wrapped_call_impl
ERROR 02-28 09:16:29 worker_base.py:572] return self._call_impl(*args, **kwargs)
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1747, in _call_impl
ERROR 02-28 09:16:29 worker_base.py:572] return forward_call(*args, **kwargs)
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/vllm/model_executor/models/deepseek_v3.py", line 158, in forward
ERROR 02-28 09:16:29 worker_base.py:572] final_hidden_states = self.experts(
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1736, in _wrapped_call_impl
ERROR 02-28 09:16:29 worker_base.py:572] return self._call_impl(*args, **kwargs)
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1747, in _call_impl
ERROR 02-28 09:16:29 worker_base.py:572] return forward_call(*args, **kwargs)
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/vllm/model_executor/layers/fused_moe/layer.py", line 584, in forward
ERROR 02-28 09:16:29 worker_base.py:572] final_hidden_states = self.quant_method.apply(
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/vllm/model_executor/layers/fused_moe/layer.py", line 118, in apply
ERROR 02-28 09:16:29 worker_base.py:572] return self.forward(x=x,
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/vllm/model_executor/custom_op.py", line 23, in forward
ERROR 02-28 09:16:29 worker_base.py:572] return self._forward_method(*args, **kwargs)
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/vllm_ascend/ops/fused_moe.py", line 152, in forward_oot
ERROR 02-28 09:16:29 worker_base.py:572] topk_weights, topk_ids = group_topk(
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/vllm_ascend/ops/fused_moe.py", line 49, in group_topk
ERROR 02-28 09:16:29 worker_base.py:572] torch_npu.npu_group_topk(input=scores, out=scores, group_num=num_expert_group, k=topk_group)
ERROR 02-28 09:16:29 worker_base.py:572] File "/usr/local/python3.10/lib/python3.10/site-packages/torch/_ops.py", line 1116, in __call__
ERROR 02-28 09:16:29 worker_base.py:572] return self._op(*args, **(kwargs or {}))
ERROR 02-28 09:16:29 worker_base.py:572] RuntimeError: GroupTopkOperation CreateOperation failed!
I want to run inference of a [specific model](put link here). I don't know how to integrate it with vllm.
The text was updated successfully, but these errors were encountered:
myliangchengyu
changed the title
[Usage]: 使用0.7.1rc1四机推理Deepseek-V3报错,Torch not compiled with CUDA enabled
[Usage]: 使用0.7.1rc1四机推理Deepseek-V3报错,RuntimeError: GroupTopkOperation CreateOperation failed!
Feb 28, 2025
Your current environment
使用quay.io/ascend/vllm-ascend:v0.7.1rc1镜像,ray版本2.43.0,环境搭建应该没问题,已经可以四机跑通如Qwen2.5-72B-Instruct等权重,已解决跑Deepseek_v3时报错Torch not compiled with CUDA enabled,出现新报错,报错信息如下:
config文件为:
在ray head节点四机的启动任务的命令为:
How would you like to use vllm on ascend
I want to run inference of a [specific model](put link here). I don't know how to integrate it with vllm.
The text was updated successfully, but these errors were encountered: