How to quantize to gguf using llama.cpp correctly #29

snowyu · 2024-08-09T03:48:16Z

@asirgogogo I tried convert_hf_to_gguf.py but get errror "ERROR:hf-to-gguf:Model IndexForCausalLM is not supported".
The old examples/convert_legacy_llama.py can convert to gguf. but this gguf output meaningless repeated characters only.

The text was updated successfully, but these errors were encountered:

saber258 · 2024-10-26T06:05:27Z

llama.cpp need some revision related the model structure in convert.py......So you have to add the revision by yourself or look forward to opening coding of revised convert.py by the researchers of index-1.9B.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to quantize to gguf using llama.cpp correctly #29

How to quantize to gguf using llama.cpp correctly #29

snowyu commented Aug 9, 2024

saber258 commented Oct 26, 2024

How to quantize to gguf using llama.cpp correctly #29

How to quantize to gguf using llama.cpp correctly #29

Comments

snowyu commented Aug 9, 2024

saber258 commented Oct 26, 2024