Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

ai-sage
/
GigaChat3.1-10B-A1.8B-GGUF

Text Generation
Transformers
GGUF
Russian
English
instruct
Mixture of Experts
multilingual
tool-use
long-context
conversational
Model card Files Files and versions
xet
Community
6
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Performance report for Q6_k on RTX 3090 and RTX 4090D

1
#6 opened 21 days ago by
SlavikF

Thanks! This model (Q4) runs at 110 t/s on my RTX 3060 12gb (llama-server, no MTP)

🔥 1
2
#5 opened 25 days ago by
BoriskaML

Works at LM Studio, deserves a like

❤️🔥 1
#4 opened 25 days ago by
havem0ney

Нет информации по качеству квантов

🤯 1
2
#3 opened 26 days ago by
Catlilface

В ollama сообщает, что не доступен tool calling

1
#2 opened 26 days ago by
Remzalp

плюс медведь жена и миска пельмени

👀 3
#1 opened 26 days ago by
Debich
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs