vllm-project / vllm Public

Notifications You must be signed in to change notification settings
Fork 4.7k
Star 30.9k

Code
Issues 1.6k
Pull requests 377
Discussions
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security
Insights

Pull requests: vllm-project/vllm

Labels 56 Milestones 0

New pull request New

377 Open 4,498 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[Model] Support Mantis(LLaVA) model ci/build documentation

Improvements or additions to documentation

#10711 opened Nov 27, 2024 by DarkLight1337 • Draft

[Bugfix][Mamba] Fix Multistep on Mamba-like models ready

ONLY add when PR is ready to merge/full CI is needed

#10705 opened Nov 27, 2024 by mzusman

Loading…

[WIP][CI]add genai-perf benchmark in nightly benchmark ci/build

#10704 opened Nov 27, 2024 by jikunshang

Loading…

[Doc] Update model in arch_overview.rst to match comment documentation

Improvements or additions to documentation

#10701 opened Nov 27, 2024 by spacewander

Loading…

[V1] Initial support of multimodal models for V1 re-arch

#10699 opened Nov 27, 2024 by ywang96 • Draft

1 of 4 tasks

[ci] fix slow tests

#10698 opened Nov 27, 2024 by youkaichao

Loading…

[V1] get_computed_blocks avoids recompute

#10695 opened Nov 27, 2024 by Abatom

Loading…

[Model] support bitsandbytes quantization with minicpm3 model

#10682 opened Nov 27, 2024 by zixuanzhang226

Loading…

[WIP] Adding min tokens/repetition/presence/frequence penalties to V1 sampler

#10681 opened Nov 26, 2024 by sroy745 • Draft

[3/N] Support and implement merged input processor for LLaVA model

#10676 opened Nov 26, 2024 by DarkLight1337

Loading…

[Bugfix] Fix GGUF inference with FP16 unquantized checkpoint ready

ONLY add when PR is ready to merge/full CI is needed

#10675 opened Nov 26, 2024 by Isotr0py

Loading…

[Doc] Add github links for source code references documentation

Improvements or additions to documentation

#10672 opened Nov 26, 2024 by russellb

Loading…

[v1][WIP] Metrics & Stats prototype

#10651 opened Nov 26, 2024 by rickyyx • Draft

[Core] Integrate Fastsafetensor loader for loading model weights ci/build documentation

Improvements or additions to documentation

#10647 opened Nov 26, 2024 by manish-sethi • Draft

[V1] VLM - Support running the mm_mapper preprocessor in the frontend process frontend needs-rebase

#10640 opened Nov 25, 2024 by alexm-neuralmagic

Loading…

[Frontend] don't block event loop in tokenization (preprocess) in OpenAI compatible server frontend

#10635 opened Nov 25, 2024 by tomeras91

Loading…

[Misc] Allow LoRA to adaptively increase rank and remove possible_max_ranks

#10623 opened Nov 25, 2024 by JinhyunBang

Loading…

[core] improve cpu offloading implementation

#10609 opened Nov 24, 2024 by youkaichao • Draft

[Core][Bugfix] Use correct device to initialize GPU data during CUDA-graph-capture

#10608 opened Nov 24, 2024 by IdoAsraff

Loading…

[fix] Correct num_accepted_tokens counting ready

ONLY add when PR is ready to merge/full CI is needed

#10604 opened Nov 24, 2024 by KexinFeng

Loading…

[Interleaved ATTN] Support for Mistral-8B

#10591 opened Nov 23, 2024 by patrickvonplaten

Loading…

[WIP] V1 LoRA support needs-rebase

#10579 opened Nov 22, 2024 by varun-sundar-rabindranath • Draft

[Core] Update to outlines > 0.1.4 ci/build

#10576 opened Nov 22, 2024 by russellb • Draft

[ Kernels ] [ AMD ] Add Fused MoE Configs

#10574 opened Nov 22, 2024 by robertgshaw2-neuralmagic • Draft

[Hardware][Intel-Gaudi] Enable LoRA support for Intel Gaudi (HPU)

#10565 opened Nov 22, 2024 by SanjuCSudhakaran

Loading…

Previous 1 2 3 4 5 … 15 16 Next

Previous Next

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly