UD-Q4_K_XL Potential Quantization Bug?

#5
by dcarnazzola - opened

I’m getting no output from Mistral Small 4 in llama.cpp exactly as described here: https://github.com/ggml-org/llama.cpp/issues/20668

One of the commenters indicated that some weights seemed far too great in magnitude, potentially blowing the matmul kernels.

This seems to be a quite critical issue

Sign up or log in to comment