Fast retreival of 8k vectors of dim 1024 #2005
Unanswered
ExtReMLapin
asked this question in
Q&A
Replies: 1 comment 7 replies
-
Maybe the serializer of arrays is not efficient and the cost is just in serializing the result back. Alsomif.you store large records, it could be helpful changing the page size from 65k to 2x or 4x. Could you please provide a test case or even a database with similar data to spin some tests locally? |
Beta Was this translation helpful? Give feedback.
7 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello,
We recently moved away from a H5 file to arcadedb to leverage all the DB safety features (ACID etc).
We now stores our embeddings (linked to the node it does the embedding with an edge) in Arcade.
Issue is retreival at boot of out software is quite slow.
Cypher query
Takes 21s for 7692 entries
SQL query
Takes 21s for 7692 entries
profiling of the cypher query returns that :
Returning the vector RID instead of the vector itself obviously boosts the query a lot, (takes less than 1sec) which leads me to believe it's returning the 8k*1024 dim that slows the whole query down.
vector
Property was created as anARRAY_OF_FLOATS
Beta Was this translation helpful? Give feedback.
All reactions