How to quantize to gguf using llama.cpp correctly #29

snowyu · 2024-08-09T03:48:16Z

@asirgogogo I tried convert_hf_to_gguf.py but get errror "ERROR:hf-to-gguf:Model IndexForCausalLM is not supported".
The old examples/convert_legacy_llama.py can convert to gguf. but this gguf output meaningless repeated characters only.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to quantize to gguf using llama.cpp correctly #29

How to quantize to gguf using llama.cpp correctly #29

snowyu commented Aug 9, 2024

How to quantize to gguf using llama.cpp correctly #29

How to quantize to gguf using llama.cpp correctly #29

Comments

snowyu commented Aug 9, 2024