stduhpf
/

SD3.5-Large-Turbo-GGUF-mixed-sdcpp

Model card Files Files and versions

stduhpf commited on Oct 27, 2024

Commit

aa71495

·

verified ·

1 Parent(s): be70377

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -14,6 +14,8 @@ Quantized using this PR https://github.com/leejet/stable-diffusion.cpp/pull/447
 Normal K-quants are not working properly with SD3.5-Large models because around 90% of the weights are in tensors whose shape doesn't match the 256 superblock size of K-quants and therefore can't be quantized this way. Mixing quantization types allows us to take adventage of the better fidelity of k-quants to some extent while keeping the model file size relatively small.
 ## Files:
 ### Mixed Types:

 Normal K-quants are not working properly with SD3.5-Large models because around 90% of the weights are in tensors whose shape doesn't match the 256 superblock size of K-quants and therefore can't be quantized this way. Mixing quantization types allows us to take adventage of the better fidelity of k-quants to some extent while keeping the model file size relatively small.
+Only the second layers of both MLPs in each MMDiT block of SD3.5 Large models have the correct shape to be compatible with k-quants. That still makes up for about 10% of all the parameters.
 ## Files:
 ### Mixed Types: