Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Model] Support Mantis(LLaVA) model ci/build documentation Improvements or additions to documentation
#10711 opened Nov 27, 2024 by DarkLight1337 Draft
[Bugfix][Mamba] Fix Multistep on Mamba-like models ready ONLY add when PR is ready to merge/full CI is needed
#10705 opened Nov 27, 2024 by mzusman Loading…
[Doc] Update model in arch_overview.rst to match comment documentation Improvements or additions to documentation
#10701 opened Nov 27, 2024 by spacewander Loading…
[ci] fix slow tests
#10698 opened Nov 27, 2024 by youkaichao Loading…
[V1] get_computed_blocks avoids recompute
#10695 opened Nov 27, 2024 by Abatom Loading…
[Bugfix] Fix GGUF inference with FP16 unquantized checkpoint ready ONLY add when PR is ready to merge/full CI is needed
#10675 opened Nov 26, 2024 by Isotr0py Loading…
[Doc] Add github links for source code references documentation Improvements or additions to documentation
#10672 opened Nov 26, 2024 by russellb Loading…
[v1][WIP] Metrics & Stats prototype
#10651 opened Nov 26, 2024 by rickyyx Draft
[Core] Integrate Fastsafetensor loader for loading model weights ci/build documentation Improvements or additions to documentation
#10647 opened Nov 26, 2024 by manish-sethi Draft
[fix] Correct num_accepted_tokens counting ready ONLY add when PR is ready to merge/full CI is needed
#10604 opened Nov 24, 2024 by KexinFeng Loading…
[Interleaved ATTN] Support for Mistral-8B
#10591 opened Nov 23, 2024 by patrickvonplaten Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.