HuggingFaceTEIDocumentEmbedder do not truncate rather they throw an exception. #7413

PAHXO · 2024-03-23T17:41:10Z

HuggingFaceTEIDocumentEmbedder do not auto-truncate rather they throw an exception.
I'd like to have the option to pass an argument to truncate my text and go on.

anakin87 · 2024-03-25T09:38:32Z

@awinml I remember that you developed this Embedder.
Do you have any ideas/suggestions?

awinml · 2024-03-25T11:54:47Z

@anakin87 This is only an issue when using a Text-embeddings-inference deployed endpoint, the HF Inference endpoints automatically truncate and but don't normalize. You can view a simple example showcasing this with both endpoints in this Colab notebook.

The default behaviour of TEI endpoints is to automatically normalize and raise an error if the tokens exceed 512. This can be changed using the truncate and normalize parameters when computing the embeddings. Please see the embed API reference for more information about this.

We use the InferenceClient.feature_extraction method to generate the embeddings. This method does not support passing the truncate and normalize parameters. I had opened a PR (huggingface/huggingface_hub#1940) to fix this, which was deferred. Instead, their suggestion was to use the InferenceClient.post method and pass in the parameters as a JSON payload. Something like this:

import json
import numpy as np
from huggingface_hub import InferenceClient

client = InferenceClient(...)
text = "Very long text"

# NOTE: `truncate` and `normalize` parameters only work for TEI-powered APIs
response = client.post(json={"inputs": [text], "truncate": True, "normalize":True}, task="feature-extraction")
response_dict = json.loads(response.decode())
embedding = np.array(response_dict, dtype="float32").tolist()

I can open a PR to refactor the embedders if this approach is okay. The other option would be to wait until they standardize their API and document this limitation in the docs.

anakin87 · 2024-03-27T08:07:39Z

Let me try to recap... Please correct me if I am wrong.

Currently, HFTEIEmbedders in Haystack support these different backends:

TEI deployed locally with Docker
TEI on paid HF Inference Endpoints
HF Free Inference API

Using the InferenceClient, truncate and normalize parameters are only taken into consideration in the first two cases.
When using the HF Free Inference API, they are ignored and the defaults are used (truncate=True; normalize=False).

If my analysis is correct, I would do the following:

introduce these 2 parameters in our Embedders with these defaults: truncate=True; normalize=False
explain well in the docstring that these two parameters are only considered when using a TEI service (locally or deployed on HF Inference Endpoints). They are ignored when using the HF Free Inference API.

@awinml WDYT?

awinml · 2024-03-27T09:55:55Z

@anakin87 Sounds good! I'll add the truncate and normalize parameters with the defaults you mentioned. The docstrings will clearly explain that these only take effect when using a local or paid TEI service, not the free Hugging Face API. Thanks for the detailed recap!

PAHXO changed the title ~~HuggingFaceTEITextEmbedder do not truncate rather they throw an exception.~~ HuggingFaceTEIDocumentEmbedder do not truncate rather they throw an exception. Mar 23, 2024

masci added the community-triage label Mar 26, 2024

awinml mentioned this issue Apr 3, 2024

feat: Add truncate and normalize parameters to TEI Embedders #7460

Merged

anakin87 closed this as completed in #7460 Apr 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HuggingFaceTEIDocumentEmbedder do not truncate rather they throw an exception. #7413

HuggingFaceTEIDocumentEmbedder do not truncate rather they throw an exception. #7413

PAHXO commented Mar 23, 2024 •

edited

Loading

anakin87 commented Mar 25, 2024

awinml commented Mar 25, 2024 •

edited

Loading

anakin87 commented Mar 27, 2024

awinml commented Mar 27, 2024

HuggingFaceTEIDocumentEmbedder do not truncate rather they throw an exception. #7413

HuggingFaceTEIDocumentEmbedder do not truncate rather they throw an exception. #7413

Comments

PAHXO commented Mar 23, 2024 • edited Loading

anakin87 commented Mar 25, 2024

awinml commented Mar 25, 2024 • edited Loading

anakin87 commented Mar 27, 2024

awinml commented Mar 27, 2024

PAHXO commented Mar 23, 2024 •

edited

Loading

awinml commented Mar 25, 2024 •

edited

Loading