Inference Providers
Active filters: torchao
DreamerRimmer/Huihui-Qwen3-4B-Instruct-2507-abliterated-ao-int4wo
Text Generation
• Updated • 1
Float16-cloud/typhoon-ocr1.5-2b-int8
Image-Text-to-Text
• Updated • 11
• 2
shawntao/Qwen2.5-VL-3B-Instruct-torchao-int8_weight_only
namgyu-youn/Qwen3-0.6B-INT8-INT4-SINQ
Text Generation
• 8B • Updated • 4
Text Generation
• 8B • Updated • 1
liangel/Qwen3-8B-AWQ-INT4
Text Generation
• 8B • Updated • 2
liangel/Qwen3-8B-INT8-INT4
Text Generation
• 8B • Updated • 5
namgyu-youn/Qwen3-30B-A3B-Thinking-2507-INT8-INT4-SINQ
pytorch/Phi-4-mini-instruct-parq-2w-4e-shared
Text Generation
• Updated • 413
pytorch/Phi-4-mini-instruct-parq-3w-4e-shared
Text Generation
• Updated • 420
metascroy/Qwen3-4B-int8-int4-unsloth-v3
Text Generation
• Updated • 7
metascroy/Ministral-3-3B-Instruct-2512-int8-int4-unsloth
Image-Text-to-Text
• Updated • 1
metascroy/Qwen3-4B-int8-int4-unsloth-v4-torchao
Text Generation
• Updated • 14
Ba2han/fp8-translate-model-torchao
pytorch/Phi-4-mini-instruct-parq-4w-4e-shared-gsm
Text Generation
• Updated • 421
imdatta0/qwen3_4b_sft_executorch-torchao
Text Generation
• Updated • 1
Novaciano/NSFW-3.2-1B-torchao-int8_weight_only
Novaciano/NSFW-3.2-1B-torchao-int8_dynamic_activation_int8_weight
Novaciano/NSFW_RP-3.2-1B-torchao-int8_weight_only
Novaciano/NSFW_RP-3.2-1B-torchao-int8_dynamic_activation_int8_weight
doublemathew/Qwen3-1.7B-int8-int4-unsloth-v3
Text Generation
• Updated • 3
metascroy/Qwen3-0.6B-int8-int4-unsloth
Text Generation
• Updated • 17
Entity-27th/Qwen3-RP-Pruned
Text Generation
• 7B • Updated • 7
doublemathew/Qwen3-4B-int8-int4-unsloth-v3
Text Generation
• Updated • 3
namgyu-youn/Qwen3-8B-INT4
doublemathew/Qwen3-4B-int8-int4-unsloth
Text Generation
• Updated • 7
namgyu-youn/Qwen3-8B-W8A8-FP
Text Generation
• Updated • 1