Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

431

Base only

Active filters: torchao

bramy05/qwen3-8b-heretic-fp8-e4m3fn-wo

8B • Updated Feb 20 • 4

Gunulhona/Gemma-3-4B-AWQ-INT4

Image-Text-to-Text • Updated Feb 23 • 3

Gunulhona/Gemma-3-27B-v2-AWQ-INT4

Image-Text-to-Text • Updated Feb 23 • 2

CEIA-POSITIVO/Qwen-1.7B-capado_q4-torchao

Text Generation • Updated Mar 3 • 1

CEIA-POSITIVO/Qwen-1.7B-capado_QAT-torchao

Text Generation • Updated Mar 3 • 2

appy1234/gemma-3-27b-it-INT4

Image-Text-to-Text • 6B • Updated Mar 11 • 2

wantsleep/MiniPLM-Qwen-500M-INT4

Text Generation • 0.2B • Updated Mar 14 • 1

wantsleep/Qwen3-8B-INT4

Text Generation • 2B • Updated Mar 14 • 3

wantsleep/Qwen3-0.6B-INT4

Text Generation • 0.2B • Updated Mar 14

wantsleep/Eagle3-Qwen3-8B-zh-INT4

Text Generation • 0.9B • Updated Mar 15 • 3

namgyu-youn/gemma-3-27b-it-AWQ-INT4-v2

Image-Text-to-Text • Updated Apr 8 • 5

caiovicentino1/Qwen3.5-9B-EOQ-Dynamic-BitPacked

5B • Updated Apr 6 • 13 • 1

mingxilei/functiongemma-270m-it-ft-QAT-torchao

Text Generation • Updated Apr 1 • 3

UdayG01/orpheus_3b-hi-ft-qat-torchao

Updated Apr 2 • 5

AlekseyCalvin/LYRICAL_POET_Gemma4e2b_v1_fp8

Image-Text-to-Text • 5B • Updated Apr 9 • 5

SocialLocalMobile/Qwen3.6-35B-A3B-HQQ-INT4

Image-Text-to-Text • 20B • Updated Apr 17 • 4

ghidav/qwen3-1.7b-nvfp4

Text Generation • Updated Apr 17 • 1.62k

Jessylg27/tribev2-lite-qv

Updated Apr 19 • 51 • 2

Jessylg27/tribev2-balanced-qv

Updated Apr 19 • 13

Jessylg27/tribev2-high-quality-qv

Updated Apr 19 • 22

luongnguyenminhan/qwen3_0.6B_4bit-torchao

Text Generation • Updated Apr 26 • 2

amd/Llama-3.1-8B-Instruct-da8w8-torchao-v0.16.0

Text Generation • Updated Apr 30 • 2.35k • 1

amd/Qwen2.5-VL-7B-Instruct-da8w8-torchao-v0.16.0

Image-Text-to-Text • Updated May 4 • 9

amd/Phi-4-da8w8-torchao-v0.16.0

Text Generation • Updated May 4 • 2k

amd/Qwen3-14B-Instruct-da8w8-torchao-v0.16.0

Text Generation • Updated May 4 • 1.76k

lmq1909/checkpoint-100e-1k-multitask-int4-torchao

Text Generation • Updated May 5 • 4

hansacha/qwen-image-2512-fp8-diffusers

Updated May 6 • 101

hansacha/qwen-image-edit-2511-fp8-diffusers

Updated May 6 • 93 • 1

bytkim/Qwen3.6-27B-int4-MTP-1k

6B • Updated May 7 • 4

bytkim/Qwen3.6-27B-int4-MTP2-1k

6B • Updated May 7 • 6