Inference Providers
Active filters: vptq
VPTQ-community/Mistral-Large-Instruct-2407-v8-k65536-0-woft
9B • Updated • 1
VPTQ-community/Mistral-Large-Instruct-2407-v16-k65536-16384-woft
9B • Updated • 2
• 2
VPTQ-community/Mistral-Large-Instruct-2407-v8-k65536-256-woft
13B • Updated • 2
VPTQ-community/Mistral-Large-Instruct-2407-v16-k65536-65536-woft
10B • Updated • 8
• 1
VPTQ-community/Mistral-Large-Instruct-2407-v8-k65536-65536-woft
17B • Updated • 2
• 2
VPTQ-community/Mistral-Large-Instruct-2407-v16-k65536-1024-woft
8B • Updated • 1
VPTQ-community/Mistral-Large-Instruct-2407-v16-k65536-4096-woft
8B • Updated • 9
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v8-k65536-256-woft
42B • Updated • 2
• 1
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v8-k65536-65536-woft
55B • Updated • 6
VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v8-k65536-256-woft
9B • Updated • 2
VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v16-k65536-65536-woft
8B • Updated • 13
VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v8-k65536-65536-woft
11B • Updated • 4
• 5
VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v16-k65536-1024-woft
6B • Updated • 18
VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v8-k65536-0-woft
7B • Updated • 2
VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v16-k65536-16384-woft
7B • Updated • 13
VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v16-k65536-256-woft
6B • Updated • 5
• 1
VPTQ-community/Meta-Llama-3.3-70B-Instruct-v8-k65536-256-woft
9B • Updated • 2
VPTQ-community/Meta-Llama-3.3-70B-Instruct-v16-k65536-16384-woft
7B • Updated • 2
VPTQ-community/Meta-Llama-3.3-70B-Instruct-v8-k65536-0-woft
7B • Updated • 2
VPTQ-community/Meta-Llama-3.3-70B-Instruct-v16-k65536-65536-woft
8B • Updated • 7
VPTQ-community/Meta-Llama-3.3-70B-Instruct-v8-k65536-65536-woft
11B • Updated • 2
• 1
VPTQ-community/Meta-Llama-3.3-70B-Instruct-v16-k65536-1024-woft
6B • Updated • 4
• 1
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v12-k65536-4096-woft-vllm
2B • Updated • 2
alpindale/Meta-Llama-3.1-405B-Instruct-v16-k65536-256-woft-perm
24B • Updated • 3
• 2
Tiberiw/TinyLlama-1.1B-VPTQ
0.3B • Updated • 3
Tiberiw/TinyLlama-1.1B-VPTQ-v8-k65536-256
0.3B • Updated • 2
Tiberiw/TinyLlama-1.1B-VPTQ-v8-k65536-65536
0.4B • Updated • 1
Tiberiw/TinyLlama-1.1B-VPTQ-v8-k65536-0
0.3B • Updated • 3
Tiberiw/TinyLlama-1.1B-VPTQ-v16-k256-0
0.1B • Updated • 1
Tiberiw/TinyLlama-1.1B-VPTQ-v8-k16384-0
0.2B • Updated • 1