inference-optimization/Meta-Llama-3-8B-Instruct-spinquantR1R2R4-w4a16-gptq 2B • Updated about 6 hours ago
inference-optimization/Meta-Llama-3-8B-Instruct-spinquantR1R2R4-w4a16-gptq 2B • Updated about 6 hours ago
inference-optimization/Meta-Llama-3-8B-Instruct-spinquantR1R2R4-w4a16-qmod 2B • Updated about 6 hours ago
inference-optimization/Meta-Llama-3-8B-Instruct-spinquantR1R2R4-w4a16-qmod 2B • Updated about 6 hours ago
inference-optimization/Meta-Llama-3-8B-Instruct-spinquantR1R2R4-nvfp4-qmod 5B • Updated about 6 hours ago
inference-optimization/Meta-Llama-3-8B-Instruct-spinquantR1R2R4-nvfp4-qmod 5B • Updated about 6 hours ago
inference-optimization/Meta-Llama-3-8B-Instruct-spinquantR1R2R4-nvfp4-gptq 5B • Updated about 6 hours ago
inference-optimization/Meta-Llama-3-8B-Instruct-spinquantR1R2R4-nvfp4-gptq 5B • Updated about 6 hours ago