-
-
Notifications
You must be signed in to change notification settings - Fork 5.1k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[V1] [2/n] Logging and Metrics -
OutputProcessor
Abstraction
#11973
opened Jan 12, 2025 by
robertgshaw2-neuralmagic
Loading…
[CI][Spec Decode] fix: broken test for EAGLE model
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#11972
opened Jan 12, 2025 by
llsj14
Loading…
[Misc] Support register quantization method out-of-tree
#11969
opened Jan 12, 2025 by
ice-tong
Loading…
[Kernel] Attention.forward with unified_attention when use_direct_call=True
#11967
opened Jan 12, 2025 by
heheda12345
Loading…
[Spec Decode] feat: support LoRA with speculative decoding
#11966
opened Jan 12, 2025 by
llsj14
Loading…
[V1][Core][1/n] Logging and Metrics
ready
ONLY add when PR is ready to merge/full CI is needed
#11962
opened Jan 11, 2025 by
robertgshaw2-neuralmagic
Loading…
[V1] Move more control of kv cache initialization from model_executor to EngineCore
#11960
opened Jan 11, 2025 by
heheda12345
Loading…
[Misc] Fix Deepseek V2 fp8 kv-scale remapping
ready
ONLY add when PR is ready to merge/full CI is needed
#11947
opened Jan 11, 2025 by
Concurrensee
Loading…
[Misc] Add helpers to get pipeline rank & world size
#11946
opened Jan 10, 2025 by
ethnzhng
Loading…
[Ignore] Test
documentation
Improvements or additions to documentation
#11944
opened Jan 10, 2025 by
mgoin
Loading…
Organise installation documentation into categories and tabs
documentation
Improvements or additions to documentation
#11935
opened Jan 10, 2025 by
hmellor
Loading…
[Doc] links Tensorizer example
documentation
Improvements or additions to documentation
#11918
opened Jan 10, 2025 by
guspan-tanadi
Loading…
[Doc] Correct the spelling of GitHub
documentation
Improvements or additions to documentation
#11915
opened Jan 10, 2025 by
Yaminyam
Loading…
[V1] APC + prompt logprobs unsupported (PR 2/N for v1 sample and prompt logprobs support)
#11910
opened Jan 10, 2025 by
afeldman-nm
Loading…
[FP8][Kernel] Dynamic kv cache scaling factors computation
documentation
Improvements or additions to documentation
#11906
opened Jan 9, 2025 by
gshtras
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.