Skip to content

Releases: InternLM/lmdeploy

LMDeploy Release V0.2.1

19 Jan 10:38
e96e2b4
Compare
Choose a tag to compare

What's Changed

💥 Improvements

🐞 Bug fixes

📚 Documentations

  • add guide about installation on cuda 12+ platform by @lvhan028 in #988

🌐 Other

Full Changelog: v0.2.0...v0.2.1

LMDeploy Release V0.2.0

17 Jan 02:00
b319dce
Compare
Choose a tag to compare

What's Changed

🚀 Features

💥 Improvements

🐞 Bug fixes

📚 Documentations

🌐 Other

New Contributors

Full Changelog: v0.1.0...v0.2.0

LMDeploy Release V0.1.0

18 Dec 12:10
477f2db
Compare
Choose a tag to compare

What's Changed

🚀 Features

💥 Improvements

🐞 Bug fixes

📚 Documentations

🌐 Other

New Contributors

Full Changelog: v0.0.14...v0.1.0

LMDeploy Release V0.1.0a2

06 Dec 06:50
fddad30
Compare
Choose a tag to compare

What's Changed

💥 Improvements

  • Unify prefill & decode passes by @lzhangzz in #775
  • add cuda12.1 build check ci by @irexyc in #782
  • auto upload cuda12.1 python pkg to release when create new tag by @irexyc in #784
  • Report the inference benchmark of models with different size by @lvhan028 in #794
  • Add chat template for Yi by @AllentDan in #779

🐞 Bug fixes

  • Fix early-exit condition in attention kernel by @lzhangzz in #788
  • Fix missed arguments when benchmark static inference performance by @lvhan028 in #787
  • fix extra colon in InternLMChat7B template by @C1rN09 in #796
  • Fix local kv head num by @lvhan028 in #806

📚 Documentations

🌐 Other

New Contributors

Full Changelog: v0.1.0a1...v0.1.0a2

LMDeploy Release V0.1.0a1

29 Nov 13:51
9c46b27
Compare
Choose a tag to compare

What's Changed

💥 Improvements

🐞 Bug fixes

  • [Fix] build docker image failed since packaging is missing by @lvhan028 in #753
  • [Fix] Rollback the data type of input_ids to TYPE_UINT32 in preprocessor's proto by @lvhan028 in #758
  • fix turbomind build on sm<80 by @grimoire in #754
  • fix typo by @grimoire in #769

🌐 Other

Full Changelog: v0.1.0a0...v0.1.0a1

LMDeploy Release V0.1.0a0

23 Nov 13:05
a7c5007
Compare
Choose a tag to compare

What's Changed

🚀 Features

💥 Improvements

🐞 Bug fixes

📚 Documentations

🌐 Other

New Contributors

Full Changelog: v0.0.14...v0.1.0a0

LMDeploy Release V0.0.14

09 Nov 12:13
7b20cfd
Compare
Choose a tag to compare

What's Changed

💥 Improvements

🐞 Bug fixes

  • [Fix] Qwen's quantization results are abnormal & Baichuan cannot be quantized by @pppppM in #605
  • FIX: fix stop_session func bug by @yunzhongyan0 in #578
  • fix benchmark serving computation mistake by @AllentDan in #630
  • fix Tokenizer load error when the path of the being-converted model is not writable by @irexyc in #669
  • fix tokenizer_info when convert the model by @irexyc in #661

🌐 Other

New Contributors

Full Changelog: v0.0.13...v0.0.14

LMDeploy Release V0.0.13

30 Oct 06:35
56942c4
Compare
Choose a tag to compare

What's Changed

🚀 Features

💥 Improvements

📚 Documentations

🌐 Other

Full Changelog: v0.0.12...v0.0.13

LMDeploy Release V0.0.12

24 Oct 04:23
96f1b8e
Compare
Choose a tag to compare

What's Changed

🚀 Features

💥 Improvements

  • change model_format to qwen when model_name starts with qwen by @lvhan028 in #575
  • robust incremental decode for leading space by @AllentDan in #581

🐞 Bug fixes

  • avoid splitting chinese characters during decoding by @AllentDan in #566
  • Revert "[Docs] Simplify build.md" by @pppppM in #586
  • Fix crash and remove sys_instruct from chat.py and client.py by @irexyc in #591

🌐 Other

Full Changelog: v0.0.11...v0.0.12

LMDeploy Release V0.0.11

17 Oct 06:19
bb3cce9
Compare
Choose a tag to compare

What's Changed

🚀 Features

💥 Improvements

🐞 Bug fixes

  • Change shared_instance type from weakptr to shared_ptr by @lvhan028 in #507
  • [Fix] Set the default value of step being 0 by @lvhan028 in #532
  • [bug] fix mismatched shape for decoder output tensor by @akhoroshev in #517
  • Fix typing of openai protocol. by @mokeyish in #554

📚 Documentations

🌐 Other

New Contributors

Full Changelog: v0.0.10...v0.0.11