-
Notifications
You must be signed in to change notification settings - Fork 47
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
regiter qwen2_vl to rewrite qwen2_vl forwad
module:core
#241
opened Mar 4, 2025 by
zouyida2002
Loading…
[core] Support custom ascendc kernels in vllm-ascend [draft]
module:core
#233
opened Mar 4, 2025 by
ganyi1996ppo
Loading…
[CI]Make UT cases in test_comm_ops.py compatible on Ascend NPU
module:core
#220
opened Mar 3, 2025 by
wwfu109
Loading…
[BugFix]add int8 cache dtype when using attention quantization
module:core
module:ops
#128
opened Feb 21, 2025 by
Angazenn
Loading…
[WIP][Feature] Implement native fused MoE layer
module:ops
module:tests
#121
opened Feb 20, 2025 by
yiz-liu
Loading…
ProTip!
Exclude everything labeled
bug
with -label:bug.