Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

431

Base only

Active filters: torchao

DreamerRimmer/Huihui-Qwen3-4B-Instruct-2507-abliterated-ao-int4wo

Text Generation • Updated Nov 22, 2025 • 1

Float16-cloud/typhoon-ocr1.5-2b-int8

Image-Text-to-Text • Updated Nov 23, 2025 • 11 • 2

shawntao/Qwen2.5-VL-3B-Instruct-torchao-int8_weight_only

Updated Nov 24, 2025 • 1

namgyu-youn/Qwen3-0.6B-INT8-INT4-SINQ

Updated Dec 2, 2025 • 4

liangel/Qwen3-8B-FP8

Text Generation • 8B • Updated Dec 9, 2025 • 4

liangel/Qwen3-8B-INT4

Text Generation • 8B • Updated Dec 3, 2025 • 1

liangel/Qwen3-8B-AWQ-INT4

Text Generation • 8B • Updated Dec 9, 2025 • 2

liangel/Qwen3-8B-INT8-INT4

Text Generation • 8B • Updated Dec 3, 2025 • 5

namgyu-youn/Qwen3-30B-A3B-Thinking-2507-INT8-INT4-SINQ

Updated Nov 30, 2025 • 1

rahul7star/Xd

Updated Nov 27, 2025

rahul7star/gemma-3bit

Updated Dec 2, 2025 • 1

pytorch/Phi-4-mini-instruct-parq-2w-4e-shared

Text Generation • Updated Dec 16, 2025 • 413

pytorch/Phi-4-mini-instruct-parq-3w-4e-shared

Text Generation • Updated Dec 16, 2025 • 420

metascroy/Qwen3-4B-int8-int4-unsloth-v3

Text Generation • Updated Dec 5, 2025 • 7

metascroy/Ministral-3-3B-Instruct-2512-int8-int4-unsloth

Image-Text-to-Text • Updated Dec 4, 2025 • 1

metascroy/Qwen3-4B-int8-int4-unsloth-v4-torchao

Text Generation • Updated Dec 5, 2025 • 14

Ba2han/fp8-translate-model-torchao

Updated Dec 7, 2025 • 2

pytorch/Phi-4-mini-instruct-parq-4w-4e-shared-gsm

Text Generation • Updated Dec 16, 2025 • 421

imdatta0/qwen3_4b_sft_executorch-torchao

Text Generation • Updated Dec 10, 2025 • 1

Novaciano/NSFW-3.2-1B-torchao-int8_weight_only

Updated Dec 10, 2025 • 1

Novaciano/NSFW-3.2-1B-torchao-int8_dynamic_activation_int8_weight

Updated Dec 10, 2025 • 1

Novaciano/NSFW_RP-3.2-1B-torchao-int8_weight_only

Updated Dec 10, 2025 • 1

Novaciano/NSFW_RP-3.2-1B-torchao-int8_dynamic_activation_int8_weight

Updated Dec 10, 2025 • 1

doublemathew/Qwen3-1.7B-int8-int4-unsloth-v3

Text Generation • Updated Dec 10, 2025 • 3

metascroy/Qwen3-0.6B-int8-int4-unsloth

Text Generation • Updated Dec 10, 2025 • 17

Entity-27th/Qwen3-RP-Pruned

Text Generation • 7B • Updated Dec 13, 2025 • 7

doublemathew/Qwen3-4B-int8-int4-unsloth-v3

Text Generation • Updated Dec 15, 2025 • 3

namgyu-youn/Qwen3-8B-INT4

Updated Dec 18, 2025 • 4

doublemathew/Qwen3-4B-int8-int4-unsloth

Text Generation • Updated Dec 16, 2025 • 7

namgyu-youn/Qwen3-8B-W8A8-FP

Text Generation • Updated Dec 18, 2025 • 1