Inference Providers
Active filters: vLLM
mistralai/Mistral-Small-4-119B-2603
119B • Updated • 82.4k
• 354
QuantTrio/Qwen3.5-27B-AWQ
Image-Text-to-Text
• 28B • Updated • 364k
• 41
RohitUltimate/Qwen3.5_VL_2B_12k
Image-Text-to-Text
• 2B • Updated • 107
• 7
mistralai/Mistral-Small-4-119B-2603-eagle
Updated • 383
• 45
QuantTrio/Qwopus3.5-27B-v3-AWQ
Image-Text-to-Text
• 27B • Updated • 17.4k
• 9
Image-Text-to-Text
• 5B • Updated • 40.3k
• 8
unsloth/Mistral-Small-4-119B-2603-GGUF
119B • Updated • 46.2k
• 58
QuantTrio/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-AWQ
Image-Text-to-Text
• 28B • Updated • 27.7k
• 12
QuantTrio/gemma-4-31B-it-AWQ-6Bit
Image-Text-to-Text
• 31B • Updated • 8.23k
• 8
QuantTrio/gemma-4-31B-it-AWQ
Image-Text-to-Text
• 31B • Updated • 57k
• 5
JunHowie/Qwen3-4B-Instruct-2507-GPTQ-Int4
Text Generation
• 4B • Updated • 129k
• 3
QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ
Text Generation
• 31B • Updated • 776k
• 42
QuantTrio/GLM-4.7-Flash-AWQ
Text Generation
• 31B • Updated • 94.3k
• 11
QuantTrio/MiniMax-M2.5-AWQ
Text Generation
• 229B • Updated • 76k
• 14
QuantTrio/Qwen3.5-35B-A3B-AWQ
Image-Text-to-Text
• 36B • Updated • 159k
• 17
mistralai/Mistral-Small-4-119B-2603-NVFP4
Updated • 3.52k
• 81
QuantTrio/Qwopus3.5-27B-v3-AWQ-6Bit
Image-Text-to-Text
• 27B • Updated • 1.53k
• 2
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
• 9B • Updated • 71
• 6
model-scope/glm-4-9b-chat-GPTQ-Int8
Text Generation
• 9B • Updated • 16
• 2
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
• 73B • Updated • 89
• 2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
• 69B • Updated • 75
prithivMLmods/Nu2-Lupi-Qwen-14B
Text Generation
• 15B • Updated • 6
• 2
mradermacher/Nu2-Lupi-Qwen-14B-GGUF
15B • Updated • 121
• 1
mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF
15B • Updated • 231
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int4
Text Generation
• 0.6B • Updated • 60
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int8
Text Generation
• 0.6B • Updated • 8
JunHowie/Qwen3-1.7B-GPTQ-Int4
Text Generation
• 2B • Updated • 148
• 1
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
• 2B • Updated • 12
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
• 33B • Updated • 27.9k
• 4
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
• 33B • Updated • 234
• 4