FEAT: support sparse vector for bge-m3 #2540

pengjunfeng11 · 2024-11-11T10:21:37Z

支持bge-m3模型的稀疏向量生成功能，

调用方式为：model.create_embedding(text, return_sparse=True)

新增convert_ids_to_tokens方法

该方法可将token_id转换为人类可读文字，调用方式为

from xinference.client import Client

client = Client("http://ip:port")
model = client.get_model(model_name)
seq = model.convert_ids_to_tokens(key_list)

该方法返回类型为List[str]，如传入List[str]，将按顺序返回值

Fixes #2527 .

…o main sparse vector support

xinference/model/embedding/core.py

qinxuye · 2024-11-15T05:34:28Z

For convert_ids_to_tokens, can you add a test to verify it to

https://github.com/xorbitsai/inference/blob/main/xinference/model/embedding/tests/test_embedding_models.py

For bge-m3, it's too large for CI, we can test it manually.

qinxuye · 2024-11-15T05:43:24Z

xinference/model/embedding/core.py

        from sentence_transformers import SentenceTransformer

        kwargs.setdefault("normalize_embeddings", True)

+        if kwargs.get("return_sparse") and "m3" in self._model_spec.model_name.lower():
+            self._kwargs["hybrid_mode"] = True


This looks a bit disruptive to the design, I don't know if there is a more elegant way.

Are you referring only to the if judgment part or the subsequent reload part?

I mean the reload part.

How about loading bge-m3 when specifying hybrid_mode=True? This can be done in load.

pengjunfeng11 · 2024-11-15T06:02:25Z

For convert_ids_to_tokens, can you add a test to verify it to

main/xinference/model/embedding/tests/test_embedding_models.py

For bge-m3, it's too large for CI, we can test it manually.

OK

pengjunfeng11 added 5 commits November 10, 2024 18:46

支持使用bge-m3模型生成稀疏向量

a2aedfb

新增embedding模型通过token id转换为字符的方法create_embedding

46196d0

调用create_embedding时根据参数判断是否调用flagEmbedding

94b0cb7

sparse vector support

fa8ae35

Merge branch 'main' of https://github.com/pengjunfeng11/inference int…

37c2763

…o main sparse vector support

XprobeBot added this to the v0.16 milestone Nov 11, 2024

qinxuye changed the title ~~sparse vector support~~ FEAT: support sparse vector for bge-m3 Nov 11, 2024

XprobeBot added the feature label Nov 11, 2024

FEATURE: bge-m3 embedding model genarate sparse vector support

da03958

qinxuye reviewed Nov 15, 2024

View reviewed changes

xinference/model/embedding/core.py Outdated Show resolved Hide resolved

Update core.py

78735d0

qinxuye reviewed Nov 15, 2024

View reviewed changes

Update core.py

7e113bb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT: support sparse vector for bge-m3 #2540

FEAT: support sparse vector for bge-m3 #2540

pengjunfeng11 commented Nov 11, 2024 •

edited by qinxuye

Loading

qinxuye commented Nov 15, 2024 •

edited

Loading

qinxuye Nov 15, 2024

pengjunfeng11 Nov 15, 2024

qinxuye Nov 15, 2024

qinxuye Nov 15, 2024

pengjunfeng11 commented Nov 15, 2024

FEAT: support sparse vector for bge-m3 #2540

Are you sure you want to change the base?

FEAT: support sparse vector for bge-m3 #2540

Conversation

pengjunfeng11 commented Nov 11, 2024 • edited by qinxuye Loading

支持bge-m3模型的稀疏向量生成功能，

新增convert_ids_to_tokens方法

qinxuye commented Nov 15, 2024 • edited Loading

qinxuye Nov 15, 2024

Choose a reason for hiding this comment

pengjunfeng11 Nov 15, 2024

Choose a reason for hiding this comment

qinxuye Nov 15, 2024

Choose a reason for hiding this comment

qinxuye Nov 15, 2024

Choose a reason for hiding this comment

pengjunfeng11 commented Nov 15, 2024

pengjunfeng11 commented Nov 11, 2024 •

edited by qinxuye

Loading

qinxuye commented Nov 15, 2024 •

edited

Loading