LMDeploy Release V0.1.0a1
What's Changed
💥 Improvements
- Set the default value of
max_context_token_num
1 by @lvhan028 in #761 - add triton server test and workflow yml by @RunningLeon in #760
- improvement(build): enable ninja and gold linker by @tpoisonooo in #767
- Report first-token-latency and token-latency percentiles by @lvhan028 in #736
- convert model with hf repo_id by @irexyc in #774
🐞 Bug fixes
- [Fix] build docker image failed since
packaging
is missing by @lvhan028 in #753 - [Fix] Rollback the data type of
input_ids
toTYPE_UINT32
in preprocessor's proto by @lvhan028 in #758 - fix turbomind build on sm<80 by @grimoire in #754
- fix typo by @grimoire in #769
🌐 Other
Full Changelog: v0.1.0a0...v0.1.0a1