-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Cost estimation #5
Comments
We show the cost estimation and actual time cost by
As we can see, at size The most acceptable way is to take both size 1000 and 10 as an index scan, which might sacrifice some QPS at size range from 1 to 1000. But to be honest, any formula meet this condition is sured to be equivalence to the current implementation. As a conclusion, as the |
What's the first column "size"? |
Number of vector rows for inserting to the database. We select the first 10, 1000, 50000 and 100000 rows from the whole 500000 openai database as a subset, to validate the cost estimator. |
Ref: https://github.com/pgvector/pgvector/blob/158d9340bca5796d8f98182e5a65356bac676b74/src/ivfflat.c#L64
The text was updated successfully, but these errors were encountered: