Inference Providers
Active filters: w4a16
Lorbus/Qwen3.6-27B-int4-AutoRound
Image-Text-to-Text
• 6B • Updated • 1.76M
• 114
spectator2026/MiMo-V2.5-AWQ-int4
Text Generation
• 53B • Updated • 2.85k
• 3
LordNeel/DeepSeek-V4-Flash-Acti-MTP-W4A16-FP8
Text Generation
• 44B • Updated • 2.36k
• 12
canada-quant/DeepSeek-V4-Flash-W4A16-FP8-MTP
Text Generation
• 51B • Updated • 11.4k
• 6
philbert440/Qwen3.6-40B-DeckardUncensored-OpusDistilled-HermesCalibrated-W4A16-AWQ
Image-Text-to-Text
• 40B • Updated • 1.26k
• 5
atbender/Qwen3.5-REAP-262B-A17B-W4A16
Text Generation
• 40B • Updated • 75
• 4
alonsoko/gemma-4-31b-it-abliterated-heretic-AWQ-W4A16
Image-Text-to-Text
• 32B • Updated • 25.9k
• 11
ebircak/gemma-4-31B-it-4bit-W4A16-AWQ
Text Generation
• 32B • Updated • 39.3k
• 3
atbender/Qwen3.6-VL-REAP-26B-A3B-W4A16
Text Generation
• 1B • Updated • 5.3k
• 5
lyf/Qwen3.6-27B-heretic-v2-mtp-int4-AutoRound
Image-Text-to-Text
• 3B • Updated • 2.01k
• 8
webhie/Qwen3.6-27B-int4-AutoRound-Code
Image-Text-to-Text
• 6B • Updated • 30.9k
• 9
alexxorm/Huihui-Qwen3.6-27B-abliterated-AWQ
Image-Text-to-Text
• 28B • Updated • 3.6k
• 2
shisa-ai/Qwen3.6-35B-A3B-PARO-full4096-e5-packed
Text Generation
• 6B • Updated • 76
• 1
LeaderboardModel1/gemma-4-E4B-it-OBLITERATED-autoround-W4A16
Text Generation
• Updated • 1
spectator2026/Infinity-Parser2-Pro-AWQ-W4A16
Image-Text-to-Text
• 35B • Updated • 80
• 1
groxaxo/LocateAnything-3B-AutoRound-W4A16
Image-Text-to-Text
• 1B • Updated • 3
• 1
RedHatAI/Qwen2-VL-72B-Instruct-FP8-dynamic
Image-Text-to-Text
• 73B • Updated • 23
RedHatAI/Mixtral-8x22B-v0.1-quantized.w4a16
141B • Updated • 21
RedHatAI/Mixtral-8x7B-v0.1-quantized.w4a16
47B • Updated • 221
RedHatAI/QwQ-32B-Preview-quantized.w4a16
33B • Updated • 6
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w4a16
Text Generation
• 71B • Updated • 12
RedHatAI/granite-3.1-8b-instruct-quantized.w4a16
Text Generation
• 8B • Updated • 1.16k
• 1
RedHatAI/granite-3.1-2b-instruct-quantized.w4a16
Text Generation
• 3B • Updated • 38.1k
RedHatAI/DeepSeek-V2.5-1210-quantized.w4a16
Text Generation
• 238B • Updated • 38
RedHatAI/DeepSeek-Coder-V2-Instruct-0724-quantized.w4a16
Text Generation
• 238B • Updated • 43
• 1
RedHatAI/granite-3.1-2b-base-quantized.w4a16
Text Generation
• 3B • Updated • 13
RedHatAI/granite-3.1-8b-base-quantized.w4a16
Text Generation
• 8B • Updated • 98
• 1
RedHatAI/Qwen2-VL-72B-Instruct-quantized.w4a16
Image-Text-to-Text
• 74B • Updated • 9
RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w4a16
Text Generation
• 24B • Updated • 422
• 1
RedHatAI/Phi-3-vision-128k-instruct-W4A16-G128
Text Generation
• 4B • Updated • 35
• 1