[Feature] qwen2 vl support the turbomind engine #2774

DexterGuo · 2024-11-20T04:04:45Z

Motivation

1、The qwen2vl effect is the sota level in the open source model
2、lmdeploy is an excellent inference framework
3、So it's important to support turbomind

Related resources

No response

Additional context

No response

lvhan028 · 2024-11-20T05:04:08Z

#2720 is working on it.

ciwang · 2024-11-26T18:48:01Z

Hi @lvhan028, do you have an estimate of when that PR will be merged? Thank you in advance!

lvhan028 · 2024-12-12T13:14:14Z

Not yet. We are trying to refactor the VLM inference in PR #2810
TM engine will not support qwen2-vl until that PR is merged.

songlinlibit · 2024-12-19T17:31:34Z

@lvhan028 Hello, I noticed that this PR has been merged. Is it confirmed that versions after #2810 already support the use of the tm engine for Qwen2-VL inference?

comeby · 2024-12-23T03:58:06Z

Is there any plan to merge this feature?

lvhan028 assigned irexyc Nov 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] qwen2 vl support the turbomind engine #2774

[Feature] qwen2 vl support the turbomind engine #2774

DexterGuo commented Nov 20, 2024

lvhan028 commented Nov 20, 2024

ciwang commented Nov 26, 2024

lvhan028 commented Dec 12, 2024

songlinlibit commented Dec 19, 2024

comeby commented Dec 23, 2024

[Feature] qwen2 vl support the turbomind engine #2774

[Feature] qwen2 vl support the turbomind engine #2774

Comments

DexterGuo commented Nov 20, 2024

Motivation

Related resources

Additional context

lvhan028 commented Nov 20, 2024

ciwang commented Nov 26, 2024

lvhan028 commented Dec 12, 2024

songlinlibit commented Dec 19, 2024

comeby commented Dec 23, 2024