Performance report for Q6_k on RTX 3090 and RTX 4090D
1
#6 opened 21 days ago
by
SlavikF
Thanks! This model (Q4) runs at 110 t/s on my RTX 3060 12gb (llama-server, no MTP)
🔥 1
2
#5 opened 25 days ago
by
BoriskaML
Works at LM Studio, deserves a like
❤️🔥 1
#4 opened 25 days ago
by
havem0ney
Нет информации по качеству квантов
🤯 1
2
#3 opened 26 days ago
by
Catlilface
В ollama сообщает, что не доступен tool calling
1
#2 opened 26 days ago
by
Remzalp
плюс медведь жена и миска пельмени
👀 3
#1 opened 26 days ago
by
Debich