Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

241

Base only

Active filters: w4a16

Lorbus/Qwen3.6-27B-int4-AutoRound

Image-Text-to-Text • 6B • Updated Apr 22 • 1.76M • 114

spectator2026/MiMo-V2.5-AWQ-int4

Text Generation • 53B • Updated 3 days ago • 2.85k • 3

LordNeel/DeepSeek-V4-Flash-Acti-MTP-W4A16-FP8

Text Generation • 44B • Updated May 11 • 2.36k • 12

canada-quant/DeepSeek-V4-Flash-W4A16-FP8-MTP

Text Generation • 51B • Updated 13 days ago • 11.4k • 6

philbert440/Qwen3.6-40B-DeckardUncensored-OpusDistilled-HermesCalibrated-W4A16-AWQ

Image-Text-to-Text • 40B • Updated 16 days ago • 1.26k • 5

atbender/Qwen3.5-REAP-262B-A17B-W4A16

Text Generation • 40B • Updated Mar 31 • 75 • 4

alonsoko/gemma-4-31b-it-abliterated-heretic-AWQ-W4A16

Image-Text-to-Text • 32B • Updated 25 days ago • 25.9k • 11

ebircak/gemma-4-31B-it-4bit-W4A16-AWQ

Text Generation • 32B • Updated Apr 7 • 39.3k • 3

atbender/Qwen3.6-VL-REAP-26B-A3B-W4A16

Text Generation • 1B • Updated Apr 19 • 5.3k • 5

lyf/Qwen3.6-27B-heretic-v2-mtp-int4-AutoRound

Image-Text-to-Text • 3B • Updated Apr 28 • 2.01k • 8

webhie/Qwen3.6-27B-int4-AutoRound-Code

Image-Text-to-Text • 6B • Updated about 1 month ago • 30.9k • 9

alexxorm/Huihui-Qwen3.6-27B-abliterated-AWQ

Image-Text-to-Text • 28B • Updated 29 days ago • 3.6k • 2

shisa-ai/Qwen3.6-35B-A3B-PARO-full4096-e5-packed

Text Generation • 6B • Updated 26 days ago • 76 • 1

LeaderboardModel1/gemma-4-E4B-it-OBLITERATED-autoround-W4A16

Text Generation • Updated 22 days ago • 1

spectator2026/Infinity-Parser2-Pro-AWQ-W4A16

Image-Text-to-Text • 35B • Updated 13 days ago • 80 • 1

groxaxo/LocateAnything-3B-AutoRound-W4A16

Image-Text-to-Text • 1B • Updated 3 days ago • 3 • 1

RedHatAI/Qwen2-VL-72B-Instruct-FP8-dynamic

Image-Text-to-Text • 73B • Updated Mar 31, 2025 • 23

RedHatAI/Mixtral-8x22B-v0.1-quantized.w4a16

141B • Updated Jan 3, 2025 • 21

RedHatAI/Mixtral-8x7B-v0.1-quantized.w4a16

47B • Updated Mar 1, 2025 • 221

RedHatAI/QwQ-32B-Preview-quantized.w4a16

33B • Updated Jan 3, 2025 • 6

RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w4a16

Text Generation • 71B • Updated Jan 3, 2025 • 12

RedHatAI/granite-3.1-8b-instruct-quantized.w4a16

Text Generation • 8B • Updated 2 days ago • 1.16k • 1

RedHatAI/granite-3.1-2b-instruct-quantized.w4a16

Text Generation • 3B • Updated Feb 28, 2025 • 38.1k

RedHatAI/DeepSeek-V2.5-1210-quantized.w4a16

Text Generation • 238B • Updated Jan 11, 2025 • 38

RedHatAI/DeepSeek-Coder-V2-Instruct-0724-quantized.w4a16

Text Generation • 238B • Updated Jan 12, 2025 • 43 • 1

RedHatAI/granite-3.1-2b-base-quantized.w4a16

Text Generation • 3B • Updated Feb 28, 2025 • 13

RedHatAI/granite-3.1-8b-base-quantized.w4a16

Text Generation • 8B • Updated 2 days ago • 98 • 1

RedHatAI/Qwen2-VL-72B-Instruct-quantized.w4a16

Image-Text-to-Text • 74B • Updated Mar 31, 2025 • 9

RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w4a16

Text Generation • 24B • Updated 2 days ago • 422 • 1

RedHatAI/Phi-3-vision-128k-instruct-W4A16-G128

Text Generation • 4B • Updated Feb 10, 2025 • 35 • 1