gemma-4-31b-it-qat-q4_0-assistant.gguf

Naive Q4 quant of the QAT MTP drafter for Gemma 4 31B IT QAT, for use with llama.cpp PR 23398

Downloads last month
5,859
GGUF
Model size
0.5B params
Architecture
gemma4-assistant
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Simplepotat/gemma-4-31b-it-qat-q4_0-assistant-gguf