Inference Providers
Active filters: gptq
Xu-Ouyang/pythia-12b-deduped-int4-step100000-GPTQ-wikitext2
Text Generation
• 12B • Updated Nkumah7/gemma-11-2b-it-ptt-lora-exp-v1-merged-4bit-gptq
Text Generation
• 3B • Updated rinna/llama-3-youko-8b-gptq
Text Generation
• 8B • Updated • 4
rinna/llama-3-youko-8b-instruct-gptq
Text Generation
• 8B • Updated • 1
• 1
rinna/llama-3-youko-70b-gptq
Text Generation
• 71B • Updated rinna/llama-3-youko-70b-instruct-gptq
Text Generation
• 71B • Updated Xu-Ouyang/pythia-1.4b-deduped-int3-step14000-GPTQ-wikitext2
Text Generation
• 1B • Updated Xu-Ouyang/pythia-1.4b-deduped-int3-step29000-GPTQ-wikitext2
Text Generation
• 1B • Updated Xu-Ouyang/pythia-1.4b-deduped-int3-step43000-GPTQ-wikitext2
Text Generation
• 1B • Updated Xu-Ouyang/pythia-1.4b-deduped-int3-step57000-GPTQ-wikitext2
Text Generation
• 1B • Updated ChenMnZ/Llama-2-13b-EfficientQAT-w2g128-GPTQ
Text Generation
• 13B • Updated • 1
ChenMnZ/Llama-2-13b-EfficientQAT-w2g128-BitBLAS
Text Generation
• 51B • Updated • 1
ChenMnZ/Llama-2-13b-EfficientQAT-w2g64-BitBLAS
Text Generation
• 51B • Updated • 2
ChenMnZ/Llama-2-13b-EfficientQAT-w2g64-GPTQ
Text Generation
• 13B • Updated ChenMnZ/Llama-2-13b-EfficientQAT-w4g128-BitBLAS
Text Generation
• 51B • Updated Xu-Ouyang/pythia-2.8b-deduped-int4-step129000-GPTQ-wikitext2
Text Generation
• 3B • Updated ChenMnZ/Llama-2-13b-EfficientQAT-w4g128-GPTQ
Text Generation
• 13B • Updated ChenMnZ/Llama-2-70b-EfficientQAT-w2g128-BitBLAS
Text Generation
• 274B • Updated • 1
ChenMnZ/Llama-2-70b-EfficientQAT-w2g128-GPTQ
Text Generation
• 69B • Updated ChenMnZ/Llama-2-70b-EfficientQAT-w2g64-GPTQ
Text Generation
• 69B • Updated ChenMnZ/Llama-2-70b-EfficientQAT-w4g128-BitBLAS
Text Generation
• 275B • Updated • 1
ChenMnZ/Llama-2-70b-EfficientQAT-w4g128-GPTQ
Text Generation
• 69B • Updated • 1
Xu-Ouyang/pythia-2.8b-deduped-int3-step14000-GPTQ-wikitext2
Text Generation
• 3B • Updated • 1
Xu-Ouyang/pythia-12b-deduped-int3-step14000-GPTQ-wikitext2
Text Generation
• 11B • Updated • 4
ChenMnZ/Llama-2-7b-EfficientQAT-w2g128-GPTQ
Text Generation
• 7B • Updated • 1
ChenMnZ/Llama-2-7b-EfficientQAT-w2g64-GPTQ
Text Generation
• 7B • Updated • 5
• 1
Xu-Ouyang/pythia-2.8b-deduped-int3-step29000-GPTQ-wikitext2
Text Generation
• 3B • Updated • 6
ModelCloud/gemma-2-27b-it-gptq-4bit
Text Generation
• 28B • Updated • 114
• 12
ChenMnZ/Llama-2-7b-EfficientQAT-w4g128-GPTQ
Text Generation
• 7B • Updated • 2
ChenMnZ/Llama-3-70b-EfficientQAT-w2g128-GPTQ
Text Generation
• 71B • Updated • 1