Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Length penalty draft #10

Draft
wants to merge 64 commits into
base: release
Choose a base branch
from
Draft

Length penalty draft #10

wants to merge 64 commits into from

Conversation

maxdebayser
Copy link
Contributor

This is my first go at implementing the length penalty in vLLM, just to get some feedback. But from our meeting earlier Nick hinted that there might be another way to implement this in our TGIS wrapper, without going deep in the vLLM implementation, so I'll try that approach too.

The relevant change in this draft are in sampler.py

I'm basing this PR on Joe's request-validation branch because it builds on some of the new features there, but I'm also pulling in some recent changes from the release branch here.

njhill and others added 30 commits March 20, 2024 14:51
This adds CI to build and push our UBI-based docker image to quay.
It also removes all of the CI for building and packaging the vllm library.

---------

Signed-off-by: Joe Runde <[email protected]>
Co-authored-by: Travis Johnson <[email protected]>
Co-authored-by: rsnm2 <[email protected]>
Co-authored-by: Luka <luka@paperspace>
And update UBI base image, grpcio-tools and accelerate dep versions.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.