-
-
Notifications
You must be signed in to change notification settings - Fork 6.1k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Bugfix] Fix validation of
logprobs
in ChatCompletionRequest
frontend
#14352
opened Mar 6, 2025 by
schoennenbeck
Loading…
[Misc][Docs] fix the comments of KV_T and CACHE_T in CALL_RESHAPE_AND_CACHE_XX macros
#14347
opened Mar 6, 2025 by
yangsijia-serena
Loading…
[Misc] Add Phi4-MM example
documentation
Improvements or additions to documentation
#14343
opened Mar 6, 2025 by
jeejeelee
Loading…
Use the optimized block sizes after tuning the kernel.
v1
#14329
opened Mar 6, 2025 by
vanbasten23
•
Draft
[Misc] Ensure out-of-tree quantization method recognize by cli args
#14328
opened Mar 6, 2025 by
liuyanyi
Loading…
[Bugfix][Core] fix abort_seq_group and memory leak when n>1
#14326
opened Mar 6, 2025 by
courage17340
Loading…
Revert "[torch.compile] Fix RMSNorm + quant fusion in the non-cutlass…
#14317
opened Mar 5, 2025 by
tlrmchlsmth
Loading…
[ROCm] Enable chunked prefill/paged attention in MLA on ROCm
#14316
opened Mar 5, 2025 by
SageMoore
Loading…
[Hardware][TPU]Enable ragged paged attention kernel and resolve recompilation issue
v1
#14310
opened Mar 5, 2025 by
yaochengji
•
Draft
[V1] TPU - Remove self.kv_caches
documentation
Improvements or additions to documentation
v1
#14309
opened Mar 5, 2025 by
alexm-redhat
Loading…
[Kernel] Add needs_fixed_stride_order tag to most GEMMs
#14306
opened Mar 5, 2025 by
tlrmchlsmth
Loading…
[Distributed] Add enable_expert_parallel arg
documentation
Improvements or additions to documentation
#14305
opened Mar 5, 2025 by
tlrmchlsmth
Loading…
[Docs] Add nsight guide to profiling docs
documentation
Improvements or additions to documentation
#14298
opened Mar 5, 2025 by
mgoin
Loading…
[Model] add colqwen2_vl code & inference
documentation
Improvements or additions to documentation
#14291
opened Mar 5, 2025 by
BloomBerry
Loading…
[MISC] rename interval to max_recent_requests
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#14285
opened Mar 5, 2025 by
andyxning
Loading…
[Misc] Set default value of seed to None
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
#14274
opened Mar 5, 2025 by
SmartManoj
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-03-03.