vllm-project / vllm Public

Notifications You must be signed in to change notification settings
Fork 5.9k
Star 39.5k

Code
Issues 1.3k
Pull requests 441
Discussions
Actions
Projects 3
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: vllm-project/vllm

Labels 59 Milestones 0

New pull request New

441 Open 6,194 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[WIP][Core] Support tensor parallelism with uneven heads needs-rebase

#13934 opened Feb 26, 2025 by yixqiao • Draft

[V1] [Spec Decode] Support random sampling in Rejection Sampler v1

#13933 opened Feb 26, 2025 by LiuXiaoxuanPKU • Draft

Add test for deep gemm matmul

#13932 opened Feb 26, 2025 by bnellnm • Draft

[V1] EP + DP Attention WIP

#13931 opened Feb 26, 2025 by tlrmchlsmth • Draft

Update LMFE version to v0.10.11 to support new versions of transforme… ci/build

#13930 opened Feb 26, 2025 by noamgat

Loading…

[misc] Rename Ray ADAG to Compiled Graph ready

ONLY add when PR is ready to merge/full CI is needed

#13928 opened Feb 26, 2025 by ruisearch42

Loading…

Add RELEASE.md documentation

Improvements or additions to documentation

#13926 opened Feb 26, 2025 by atalman

Loading…

Add workaround for shared field_names in pydantic model class frontend

#13925 opened Feb 26, 2025 by maxdebayser

Loading…

[V1] AsyncLLM data parallel WIP v1

#13923 opened Feb 26, 2025 by njhill • Draft

[ROCm][V1] Update reshape_and_cache to properly work with CUDA graph padding ready

ONLY add when PR is ready to merge/full CI is needed

#13922 opened Feb 26, 2025 by SageMoore

Loading…

[Build] Make sure local main branch is synced when VLLM_USE_PRECOMPILED=1 ci/build

#13921 opened Feb 26, 2025 by comaniac

Loading…

[Misc] Add JSON format logging support with loguru ci/build

#13920 opened Feb 26, 2025 by b8zhong

Loading…

[Kernel] Add more tuned configs for L20, MI325, H20, MI300X, H200, etc

#13919 opened Feb 26, 2025 by simon-mo

Loading…

Add benchmark for DeepGEMM and vLLM Block FP8 Dense GEMM

#13917 opened Feb 26, 2025 by mgoin

Loading…

Fix test_block_fp8.py test for MoE

#13915 opened Feb 26, 2025 by mgoin

Loading…

[Bugfix] Check that number of images matches number of <|image|> tokens with mllama

#13911 opened Feb 26, 2025 by tjohnson31415

Loading…

[Feat][whisper] add more sampling parameters to whisper endpoint frontend

#13910 opened Feb 26, 2025 by joennlae

Loading…

Upgrade transformers to v4.49.0 ci/build documentation

Improvements or additions to documentation

ready

ONLY add when PR is ready to merge/full CI is needed

#13905 opened Feb 26, 2025 by hmellor

Loading…

[BugFix] Fix an Overflow Problem for Some Triton Fused MoE Configurations with large BLOCK_SIZE

#13901 opened Feb 26, 2025 by Concurrensee

Loading…

Fix TPU CI ci/build

#13898 opened Feb 26, 2025 by mgoin

Loading…

Fix mla prefill context performance

#13897 opened Feb 26, 2025 by ZhongYingMatrix

Loading…

XGRAMMAR now support aarch64 structured-output

#13894 opened Feb 26, 2025 by johnnynunez

Loading…

Use smaller embedding model when not testing model specifically ready

ONLY add when PR is ready to merge/full CI is needed

#13891 opened Feb 26, 2025 by hmellor

Loading…

[Misc][V1] Enhance performance of KVCacheManager._get_cached_block v1

#13878 opened Feb 26, 2025 by imkero

Loading…

[PP] Correct cache size check

#13873 opened Feb 26, 2025 by zhengy001

Loading…

Previous 1 2 3 4 5 … 17 18 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly