GGUF Export¶
Export Unsloth models to GGUF format.
Basic Export¶
Quantization Options¶
| Method | Size | Quality | Use Case |
|---|---|---|---|
q4_k_m | 4.8 bpw | Good | Recommended |
q5_k_m | 5.7 bpw | Better | Higher quality |
q8_0 | 8.5 bpw | Excellent | Maximum quality |
Advanced Options¶
model.save_pretrained_gguf(
"output_dir",
tokenizer,
quantization_method="q4_k_m",
# Optional
push_to_hub=False,
token=None,
save_method="merged_16bit", # or "lora"
)