-
-
Notifications
You must be signed in to change notification settings - Fork 5.9k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[V1] [Spec Decode] Support random sampling in Rejection Sampler
v1
#13933
opened Feb 26, 2025 by
LiuXiaoxuanPKU
•
Draft
Update LMFE version to v0.10.11 to support new versions of transforme…
ci/build
#13930
opened Feb 26, 2025 by
noamgat
Loading…
[misc] Rename Ray ADAG to Compiled Graph
ready
ONLY add when PR is ready to merge/full CI is needed
#13928
opened Feb 26, 2025 by
ruisearch42
Loading…
Add RELEASE.md
documentation
Improvements or additions to documentation
#13926
opened Feb 26, 2025 by
atalman
Loading…
Add workaround for shared field_names in pydantic model class
frontend
#13925
opened Feb 26, 2025 by
maxdebayser
Loading…
[ROCm][V1] Update reshape_and_cache to properly work with CUDA graph padding
ready
ONLY add when PR is ready to merge/full CI is needed
#13922
opened Feb 26, 2025 by
SageMoore
Loading…
[Build] Make sure local main branch is synced when VLLM_USE_PRECOMPILED=1
ci/build
#13921
opened Feb 26, 2025 by
comaniac
Loading…
[Misc] Add JSON format logging support with
loguru
ci/build
#13920
opened Feb 26, 2025 by
b8zhong
Loading…
[Kernel] Add more tuned configs for L20, MI325, H20, MI300X, H200, etc
#13919
opened Feb 26, 2025 by
simon-mo
Loading…
Add benchmark for DeepGEMM and vLLM Block FP8 Dense GEMM
#13917
opened Feb 26, 2025 by
mgoin
Loading…
[Bugfix] Check that number of images matches number of <|image|> tokens with mllama
#13911
opened Feb 26, 2025 by
tjohnson31415
Loading…
[Feat][whisper] add more sampling parameters to whisper endpoint
frontend
#13910
opened Feb 26, 2025 by
joennlae
Loading…
Upgrade Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
transformers
to v4.49.0
ci/build
documentation
#13905
opened Feb 26, 2025 by
hmellor
Loading…
[BugFix] Fix an Overflow Problem for Some Triton Fused MoE Configurations with large BLOCK_SIZE
#13901
opened Feb 26, 2025 by
Concurrensee
Loading…
Use smaller embedding model when not testing model specifically
ready
ONLY add when PR is ready to merge/full CI is needed
#13891
opened Feb 26, 2025 by
hmellor
Loading…
[Misc][V1] Enhance performance of KVCacheManager._get_cached_block
v1
#13878
opened Feb 26, 2025 by
imkero
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.