InternLM / lmdeploy Public

Notifications You must be signed in to change notification settings
Fork 432
Star 4.7k

Code
Issues 302
Pull requests 30
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: InternLM/lmdeploy

Labels 34 Milestones 0

New pull request New

30 Open 1,198 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Add version restrictions in runtime_ascend.txt to ensure functionality

#2836 opened Nov 30, 2024 by zhabuye

Loading…

[maca] add env to support different mm layout on maca.

#2835 opened Nov 29, 2024 by Reinerzhou

Loading…

[maca] add cudagraph support on maca backend.

#2834 opened Nov 29, 2024 by Reinerzhou

Loading…

Update internvl chat template

#2832 opened Nov 29, 2024 by AllentDan

Loading…

add openssh-server installation in dockerfile

#2830 opened Nov 28, 2024 by lvhan028

Loading…

profile throughput without new threads improvement

#2826 opened Nov 28, 2024 by grimoire

Loading…

better kv allocate improvement

#2814 opened Nov 26, 2024 by grimoire

Loading…

Refactor VLM modules improvement

#2810 opened Nov 25, 2024 by lvhan028

Loading…

Refactor turbomind attention by precomputing cos/sin improvement

#2801 opened Nov 25, 2024 by irexyc

Loading…

[Feature] Support llava onevision enhancement

New feature or request

#2783 opened Nov 21, 2024 by deepindeed2022

Loading…

[ascend]feat: support kv int8 enhancement

New feature or request

#2736 opened Nov 11, 2024 by yao-fengchen

Loading…

support qwen2-vl with turbomind backend enhancement

New feature or request

#2720 opened Nov 6, 2024 by irexyc

Loading…

Run loop.run_until_complete in another thread improvement

#2701 opened Nov 4, 2024 by AllentDan

Loading…

update pre-commit config

#2683 opened Oct 30, 2024 by lvhan028

Loading…

support release pipeline improvement

#2581 opened Oct 11, 2024 by irexyc

Loading…

Torchrun launching multiple api_server

#2402 opened Aug 30, 2024 by AllentDan

Loading…

More w8a8 models

#2373 opened Aug 26, 2024 by AllentDan • Draft

[Feature] Support vision module w8a8 inference improvement

#2308 opened Aug 14, 2024 by AllentDan

Loading…

better formatted table of 'lmdeploy list' improvement WIP

#2289 opened Aug 12, 2024 by lvhan028

Loading…

[Feature] support qqq(w4a8) for lmdeploy

#2274 opened Aug 9, 2024 by HandH1998

Loading…

6 tasks done

[Feature] Support XTuner Lite Llava enhancement

New feature or request

#2191 opened Jul 31, 2024 by pppppM

Loading…

Add prefix cache stats to usage

#2018 opened Jul 13, 2024 by ispobock

Loading…

Add Jetson platform support (by docker)

#1820 opened Jun 21, 2024 by BestAnHongjun

Loading…

support vl benchmark

#1662 opened May 27, 2024 by AllentDan

Loading…

[benchmark] optimize benchmark: counting tokenlizer tokens and error requests

#1607 opened May 17, 2024 by NiuBlibing

Loading…

Previous 1 2 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly