Skip to content

Retrieval speed with a large BM25 index #1360

Answered by lintool
edoost asked this question in Q&A
Discussion options

You must be logged in to vote

Yes, retrieval over 2.7B docs is going to be slow. The standard solution is to break down into separate indexes (say, 100M docs each), search in parallel, and the merge the results.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@edoost
Comment options

Answer selected by MXueguang
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #1359 on November 28, 2022 11:29.