Llama.cpp

Riguz（留言 | 贡献）2024年7月19日 (五) 02:48的版本（→‎Convert Hugging Face Model to GGUF）

(差异) ←上一版本 | 最后版本 (差异) | 下一版本→ (差异)

Build llama.cpp

git clone https://github.com/ggerganov/llama.cpp
cd llama.cpp
make

Convert Hugging Face Model to GGUF

pip install -r requirements.txt
python convert_hf_to_gguf.py --help
#M1 MPS does not support bf16
python convert_hf_to_gguf.py ~/Documents/MODELS/Qwen2-0.5B --outfile ~/Documents/MODELS/qwen2-0.5b-fp16.gguf --outtype f16

↑ https://github.com/ggerganov/llama.cpp/blob/master/docs/build.md

检索自“https://riguz.com/index.php?title=Llama.cpp&oldid=4329”