Merge pull request #5 from svupper/main

Update to new llama.cpp quantize cmd
bofenghuang · Apr 4, 2023 · a7836ca · a7836ca
2 parents 10a65e1 + 6574401
commit a7836ca
Showing 1 changed file with 1 addition and 1 deletion.
diff --git a/README.md b/README.md
@@ -127,7 +127,7 @@ tree models
 python convert-pth-to-ggml.py ./models/7B/ 1
 
 # further quantize the model to 4-bit
-python quantize.py 7B
+./quantize ./models/7B/ggml-model-f16.bin ./models/7B/ggml-model-q4_0.bin 2
 ```
 
 ### 5. Run the inference