Does AutoGGUF support quantization for SD3 or Flux and other DIT drawing models? #79

Amazon90 · 2025-01-20T21:05:48Z

Your GUI tool is very beginner-friendly for those new to model quantization, but I’m unsure if your tool supports quantizing drawing models. I hope you can provide an answer.

https://github.com/city96/ComfyUI-GGUF/blob/main/tools/README.md

leafspark · 2025-01-27T23:34:23Z

It should support quantizing DIT models like SD3 and Flux. Here's how:

Convert your safetensors model to GGUF format:

python convert.py --src model.safetensors

Build and install the patched llama.cpp version and copy its binaries into a new folder in the llama_bin folder in your AutoGGUF directory.
Launch AutoGGUF - the new folder should appear in the backend selection.
Quantize normally, but:
- Skip text model parameters
- Use the Extra Arguments box for vision model settings

If there's prebuilt binaries in a repo you'd like to use, you could also set AUTOGGUF_BACKEND_REPO to owner/repo to pull new releases from that repository instead (although the binaries have to be in one zip file, same as the llama.cpp repo).

leafspark self-assigned this Jan 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does AutoGGUF support quantization for SD3 or Flux and other DIT drawing models? #79

Does AutoGGUF support quantization for SD3 or Flux and other DIT drawing models? #79

Amazon90 commented Jan 20, 2025 •

edited

Loading

leafspark commented Jan 27, 2025

Does AutoGGUF support quantization for SD3 or Flux and other DIT drawing models? #79

Does AutoGGUF support quantization for SD3 or Flux and other DIT drawing models? #79

Comments

Amazon90 commented Jan 20, 2025 • edited Loading

leafspark commented Jan 27, 2025

Amazon90 commented Jan 20, 2025 •

edited

Loading