Inference Providers
Active filters: exllama
zakoman/TinyLlama-1.1B-intermediate-step-1195k-token-2.5T-exl2
Text Generation
• Updated • 7
brucethemoose/Yi-34B-200K-RPMerge
Text Generation
• 34B • Updated • 86
• 60
brucethemoose/Yi-34B-200K-RPMerge-exl2-31bpw
Text Generation
• Updated • 6
brucethemoose/Yi-34B-200K-RPMerge-exl2-40bpw
Text Generation
• Updated • 15
• 20
brucethemoose/Yi-34B-200K-RPMerge-exl2-267bpw
Text Generation
• Updated • 9
• 1
nold/Yi-34B-200K-RPMerge-GGUF
34B • Updated • 22
• 6
LoneStriker/Yi-34B-200K-RPMerge-AWQ
Text Generation
• 34B • Updated • 10
• 1
LoneStriker/Yi-34B-200K-RPMerge-GPTQ
Text Generation
• Updated • 7
• 3
bartowski/Yi-34B-200K-RPMerge-exl2
Text Generation
• Updated • 7
MarinaraSpaghetti/brucethemoose_Yi-34B-200K-RPMerge-4.65bpw-h6-exl2
Text Generation
• Updated • 10
• 2
JayhC/Yi-34B-200K-RPMerge-6bpw-h6-exl2
Text Generation
• Updated • 2
backyardai/Yi-34B-200K-RPMerge-GGUF
34B • Updated • 141
• 2
mradermacher/Yi-34B-200K-RPMerge-GGUF
34B • Updated • 45
mradermacher/Yi-34B-200K-RPMerge-i1-GGUF
34B • Updated • 147
roleplaiapp/phi-4-2.5bpw-exl2
Text Generation
• Updated • 3
roleplaiapp/phi-4-6.0bpw-exl2
Text Generation
• Updated • 3
roleplaiapp/phi-4-8.0bpw-exl2
Text Generation
• Updated • 3
roleplaiapp/phi-4-4.0bpw-exl2
Text Generation
• Updated • 32
roleplaiapp/phi-4-Q3_K_S-GGUF
Text Generation
• 15B • Updated • 14
• 1
roleplaiapp/phi-4-Q5_K_M-GGUF
Text Generation
• 15B • Updated • 21
Casual-Autopsy/Maginum-Cydoms-24B-exl3
Text Generation
• Updated • 10
• 1
blockblockblock/Qwen3.5-4B-8bpw-exl3
Text Generation
• Updated • 140
blockblockblock/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-8bpw-exl3
Text Generation
• 5B • Updated • 253
blockblockblock/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-4bpw-exl3
Text Generation
• Updated • 254
blockblockblock/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-v2-8bpw-exl3
Text Generation
• 5B • Updated • 230
• 1
blockblockblock/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-v2-6bpw-exl3
Text Generation
• 4B • Updated • 26
blockblockblock/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-v2-5bpw-exl3
Text Generation
• Updated • 26
blockblockblock/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-v2-4bpw-exl3
Text Generation
• Updated • 33