Update config.py #150

diegogm · 2025-05-07T11:59:55Z

fix(embeddings): increase chunk_size for OpenAIEmbeddings to avoid API limits

Adjusted the chunk_size parameter in the OpenAIEmbeddings class to 200, based on the calculation:

Maximum payload size: 300 000 tokens
Upper‐bound chunk size: 1 500 tokens
⇒ 300 000 / 1 500 = 200

This ensures embedding requests stay within OpenAI’s maximum input size and prevents failures due to oversized requests.

fix(embeddings): increase chunk_size for OpenAIEmbeddings to avoid API limits Adjusted the `chunk_size` parameter in the `OpenAIEmbeddings` class to 200, based on the calculation: - Maximum payload size: 300 000 tokens - Upper‐bound chunk size: 1 500 tokens - ⇒ 300 000 / 1 500 = 200 This ensures embedding requests stay within OpenAI’s maximum input size and prevents failures due to oversized requests.

danny-avila · 2025-05-07T13:39:54Z

Were you not able to customize the chunk size via the .env variable? See the Readme.md

diegogm · 2025-05-07T15:21:18Z

The two settings are not the same. The chunk_size value in the .env file determines the chunk size used by the recursive text splitter (the function that breaks the text into chunks). On the other hand, the chunk_size parameter in the embedding process defines the maximum number of tokens that can be sent to the OpenAI API per request. Unfortunately, both parameters share the same name, which can be confusing.

For example, if you set chunk_size to 1000 in the .env file and load a document that produces more than 300 chunks, the OpenAI API will return an error because it supports a maximum of 300,000 tokens per call.

danny-avila · 2025-05-08T18:01:08Z

app/config.py

@@ -190,6 +190,7 @@ def init_embeddings(provider, model):
            api_key=RAG_OPENAI_API_KEY,
            openai_api_base=RAG_OPENAI_BASEURL,
            openai_proxy=RAG_OPENAI_PROXY,
+            chunk_size=200


rather than hardcode a value, it would be better to make this configurable

rubentalstra · 2025-05-14T17:10:16Z

@diegogm diegogm#1

please have a look :)

laszlovandenhoek · 2025-05-15T08:58:52Z

I have the same problem, but with Azure OpenAI. I have opened a separate pull request that implements the requested change and applies it to both OpenAI and Azure OpenAI:

#151

danny-avila requested changes May 8, 2025

View reviewed changes

laszlovandenhoek mentioned this pull request May 15, 2025

add EMBEDDINGS_CHUNK_SIZE parameter for Azure + OpenAI #151

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update config.py #150

Update config.py #150

diegogm commented May 7, 2025

danny-avila commented May 7, 2025

diegogm commented May 7, 2025

danny-avila May 8, 2025

rubentalstra commented May 14, 2025

laszlovandenhoek commented May 15, 2025 •

edited

Loading

Update config.py #150

Are you sure you want to change the base?

Update config.py #150

Conversation

diegogm commented May 7, 2025

danny-avila commented May 7, 2025

diegogm commented May 7, 2025

danny-avila May 8, 2025

Choose a reason for hiding this comment

rubentalstra commented May 14, 2025

laszlovandenhoek commented May 15, 2025 • edited Loading

laszlovandenhoek commented May 15, 2025 •

edited

Loading