Skip to content

Commit

Permalink
DR-587 add graphic to compression page (#2549)
Browse files Browse the repository at this point in the history
* DR-587 add graphic to compression page

* fix wording
  • Loading branch information
daveatweaviate committed Sep 5, 2024
1 parent e56b0d1 commit 20b6cd3
Show file tree
Hide file tree
Showing 2 changed files with 7 additions and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -71,7 +71,13 @@ Recall measures how well an algorithm finds true positive matches in a data set.

A compressed vector has less information than the full, uncompressed vector. A vector that would match a search might be missed if key information is missing from the compressed vector. That lowers recall.

To improve recall with quantized vectors, Weaviate over-fetches a list of candidate vectors during a search. Then, for each item on the candidate list, Weaviate fetches the corresponding uncompressed vector before calculating their distances to the query vector to determine the final ranking. This followup rescoring process is slower than an in-memory search, but since Weaviate only has to search a limited number of uncompressed vectors, it is still very fast. Most importantly, rescoring using the uncompressed vectors greatly improves recall.
To improve recall with quantized vectors, Weaviate over-fetches a list of candidate vectors during a search. For each item on the candidate list, Weaviate fetches the corresponding uncompressed vector. To determine the final ranking, Weaviate calculates the distances from the uncompressed vectors to the query vector.

import RescoringIllustration from '/developers/weaviate/starter-guides/managing-resources/img/rescore-uncompressed-vectors.png';

<img src={RescoringIllustration} width="100%" alt="Rescoring illustration"/>

The followup rescoring process is slower than an in-memory search, but since Weaviate only has to search a limited number of uncompressed vectors, it is still very fast. Most importantly, rescoring using the uncompressed vectors greatly improves recall.

The search algorithm's use of over-fetching and rescoring means you get the benefits of compression without losing the precision of an uncompressed vector search.

Expand Down
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit 20b6cd3

Please sign in to comment.