Non-uniform vector quantization #374

marianotepper · 2024-12-18T23:27:10Z

This PR adds a new feature that uses quantized vectors for re-ranking. The quantization is computed on subvectors of each vector in the index, using a non-uniform quantizer.

…zer.

This reverts commit 4166400.

…n OptimizationResult that facilitates error checking.

…constant to each element.

…eria and averaging over 10 trials.

…exponentiateConstantMinusVector

…nTrials to 50

This reverts commit fa295a6.

…mulation

# Conflicts: # jvector-examples/src/main/java/io/github/jbellis/jvector/example/distancesNVQ.java

# Conflicts: # jvector-base/src/main/java/io/github/jbellis/jvector/vector/VectorUtil.java # jvector-base/src/main/java/io/github/jbellis/jvector/vector/VectorUtilSupport.java # jvector-native/src/main/java/io/github/jbellis/jvector/vector/NativeVectorUtilSupport.java # jvector-twenty/src/main/java/io/github/jbellis/jvector/vector/PanamaVectorUtilSupport.java # jvector-twenty/src/main/java/io/github/jbellis/jvector/vector/SimdOps.java

…or debugging but is not truly exercized now. replace a couple of FMA patterns in the tail computations.

jkni

Some minor nits/questions. Overall, looks quite good.

jkni · 2024-12-23T16:07:53Z

jvector-base/src/main/java/io/github/jbellis/jvector/graph/disk/NVQ.java

+        var function = scorer.scoreFunctionFor(queryVector, vsf);
+
+        return new ScoreFunction.ExactScoreFunction() {
+            private final QuantizedVector scratch = NVQuantization.QuantizedVector.createEmpty(nvq.subvectorSizesAndOffsets, nvq.bitsPerDimension);


In most other places like this, we use a reusable thread-local scratch. I'm not sure if it's worth it, so consider this a possible deferred optimization.

jkni · 2024-12-23T16:44:40Z

jvector-base/src/main/java/io/github/jbellis/jvector/graph/disk/OnDiskGraphIndexWriter.java

@@ -324,6 +324,8 @@ public OnDiskGraphIndexWriter build() throws IOException {
            int dimension;
            if (features.containsKey(FeatureId.INLINE_VECTORS)) {
                dimension = ((InlineVectors) features.get(FeatureId.INLINE_VECTORS)).dimension();
+            } else if (features.containsKey(FeatureId.NVQ_VECTORS)) {
+                dimension = ((NVQ) features.get(FeatureId.NVQ_VECTORS)).dimension();
            } else {
                throw new IllegalArgumentException("Inline vectors must be provided.");


We should update this error message to indicate that either inline vectors or NVQ vectors much be provided.

jkni · 2024-12-23T16:46:45Z

jvector-base/src/main/java/io/github/jbellis/jvector/optimization/LossFunction.java

+     */
+    public LossFunction(int nDims) {
+        if (nDims <= 0) {
+            throw new IllegalArgumentException("The standard deviation initSigma must be positive");


bad error message copy/paste

jkni · 2024-12-23T16:49:01Z

jvector-base/src/main/java/io/github/jbellis/jvector/optimization/LossFunction.java

+        minBounds = new float[nDims];
+        maxBounds = new float[nDims];
+        for (int d = 0; d < nDims; d++) {
+            minBounds[d] = Float.NEGATIVE_INFINITY;


Could use Arrays.fill here

jkni · 2024-12-23T16:56:30Z

jvector-base/src/main/java/io/github/jbellis/jvector/optimization/NESOptimizer.java

+ */
+public class NESOptimizer {
+    public enum Distribution {
+        MULTINORMAL, // This is for adding future support for the multinormal case (Algorithm 5 in [1])


Do we think we'll get back to MULTINORMAL relatively soon? if not, is this worth cleaning up for now?

jkni · 2024-12-23T22:25:42Z

jvector-base/src/main/java/io/github/jbellis/jvector/vector/VectorUtilSupport.java

+  float[] nvqCosine8bit(VectorFloat<?> vector, ByteSequence<?> bytes, float growthRate, float midpoint, float minValue, float maxValue, VectorFloat<?> centroid);
+
+  /**
+   * When using 4-bit NVQ quantization and vector instructions, it is easier to unpack all even entries, and then all


Why the reference to 4-bit quantization in 8bit javadoc?

jkni · 2024-12-23T23:21:49Z

...sts/src/test/java/io/github/jbellis/jvector/optimization/TestNaturalEvolutionStrategies.java

+            }
+        }
+
+        {


introducing the new variable scopes here is perfectly fine, but IMO, would also be fine to put this all in one scope and re-use the variables. There are definitely codebases where this would be more idiomatic, but I can't think of other places we do this in JVector. Just thinking out loud -- don't feel like this has to change.

jkni · 2024-12-23T23:22:40Z

jvector-tests/src/test/java/io/github/jbellis/jvector/quantization/TestCompressedVectors.java

+    public void testSaveLoadNVQ() throws Exception {
+
+        int[][] testsConfigAndResults = {
+                //Tuples of: nDimensions, nSubvectors, number of bots per dimension, and the expected number of bytes


number of bots per dimension can be removed here (and also, typo, but it will get removed anyway)

jkni · 2024-12-23T23:24:15Z

jvector-twenty/src/main/java/io/github/jbellis/jvector/vector/SimdOps.java

+    // NVQ quantization instructions start here
+    //---------------------------------------------
+
+    static FloatVector const1f = FloatVector.broadcast(FloatVector.SPECIES_PREFERRED, 1.f);


same static final discussion here as in VectorSimdOps

jkni · 2024-12-23T23:50:03Z

jvector-base/src/main/java/io/github/jbellis/jvector/vector/VectorUtilSupport.java

+
+  /**
+   * Quantize a subvector as an 8-bit quantized subvector.
+   * All values of the vector must be in [0, 1]. For example, the input vector has been


I'm confused by All values of the vector... and then the references to bias/scale in this/nvqLoss/nvqUniformLoss. Are those remnants of a previous approach?

marianotepper added 30 commits October 24, 2024 16:55

Adding initial files for Non-uniform Vector Quantization

20642a5

Adding methods to get the number of rows and columns.

4166400

Adding method to compute the square of a double number

28764d0

Implementation of the Exponential Natural Evolution Strategies optimi…

5e9ecbb

…zer.

Minor code cleanup

c83bd2d

Revert "Adding methods to get the number of rows and columns."

eabfad3

This reverts commit 4166400.

Minor code cleanup

1c0608b

Making the constructor of NESOptimizer public. Now optimize returns a…

aa6dfdd

…n OptimizationResult that facilitates error checking.

Adding tests.

b16b0d6

Remove unused import

751634a

Remove unused variable

3a80c1f

Increase default number of samples in NESOptimizer

8ccf14f

Add a few more tests for NESOptimizer

1141303

Add a few more tests for NESOptimizer

a421f1d

Add javadocs

98c29c5

Add one test for the case without box constraints

5f986b6

Put access modifier first

69c745e

Put access modifier first

a1f94bc

Improve javadocs

81445e4

Remove blank line

cab64ba

Initial test for the non-uniform quantizer

114d60c

Add an overload for the subInPlace vector function, that subtracts a …

836f289

…constant to each element.

Completed TestNonUniformQuantization.testGaussian with a passing crit…

b3811f5

…eria and averaging over 10 trials.

Add vectorized operations pow, constantMinusExponentiatedVector, and …

c47d2b3

…exponentiateConstantMinusVector

Add vectorized operations to TestNonUniformQuantization and increase …

93ef1d3

…nTrials to 50

Add missing blank space

e4e82c5

Use VectorUtil.squareL2Distance for computing the loss

4250711

Remove unused import

fa295a6

Revert "Remove unused import"

8be166b

This reverts commit fa295a6.

Remove unused import

a096142

marianotepper and others added 17 commits December 17, 2024 15:46

Remove unused static variable from SimdOps

8188d1e

Add new NVQ code to native backend

832173a

Remove comments

afbf274

Rename directory "pq" as "quantization"

5b3c16a

Do not expose the number of bits in the API of NVQuantization

bcc5c0a

Remove unused variables and apply fix to hash function in NVQVectors

9cb9d8e

Remove NVQuantization.BitsPerDimension from CompressorParameters

2287b24

Adjust expected results in testSaveLoadNVQ to accommodate the new for…

512c899

…mulation

Remove bench files for experimenting with the Kumaraswamy approximation

062834b

Cleaning up distancesNVQ

5e9694a

Cleaning up distancesNVQ

f814efb

Cleaned up a few NVQ comments.

3d07f98

Merge remote-tracking branch 'origin/nuveq' into nuveq

53ecb05

# Conflicts: # jvector-examples/src/main/java/io/github/jbellis/jvector/example/distancesNVQ.java

Cleaning up distancesNVQ

38e54c2

Remove unused import

db927c4

Remove unused import

64d10e8

Restore the FUSED_ADC feature in Bench

9aa25f1

marianotepper assigned tlwillke and marianotepper Dec 18, 2024

marianotepper added 4 commits December 18, 2024 15:38

Merge changes from main

5599773

Cosmetic changes to distancesNVQ

46bd177

Conform to the new interface that uses encodeTo

d9bc9b0

marianotepper marked this pull request as ready for review December 19, 2024 00:42

marianotepper requested review from jbellis and jkni December 19, 2024 00:42

marianotepper added 3 commits December 19, 2024 10:52

Remove unused code to declutter. The dequantization path wsa useful f…

aedda1a

…or debugging but is not truly exercized now. replace a couple of FMA patterns in the tail computations.

Replaced the logistic/logit apir with their NQT variants

649f59e

Replaced one occurrence of snake case

ab128f8

jkni requested changes Dec 23, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Non-uniform vector quantization #374

Non-uniform vector quantization #374

marianotepper commented Dec 18, 2024

jkni left a comment

jkni Dec 23, 2024

jkni Dec 23, 2024

jkni Dec 23, 2024

jkni Dec 23, 2024

jkni Dec 23, 2024

jkni Dec 23, 2024

jkni Dec 23, 2024

jkni Dec 23, 2024

jkni Dec 23, 2024

jkni Dec 23, 2024

+                          }
+                      }
+                      {

Non-uniform vector quantization #374

Are you sure you want to change the base?

Non-uniform vector quantization #374

Conversation

marianotepper commented Dec 18, 2024

jkni left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment