Spaces:
Running
Running
Commit History
sync : ggml (#2001) cbbfa9e unverified
ggml : reuse quantum structs across backends (llama/5943) bb0625f unverified
Better 1.5 bit quantization (llama/5971) f3a62cc unverified
ggml : remove old quantization functions (llama/5942) 11a2545 unverified
ggml : add ggml-common.h to deduplicate shared code (llama/5940) 0a37735 unverified
ggml : make i-quants work with super-blocks of 64 (CPU,Metal) (llama/5760) 9a07f42 unverified
IQ4_XS: a 4.25 bpw quantization (llama/5747) 0ee1bfb unverified
IQ3_S: a much better alternative to Q3_K (llama/5676) 32589c9 unverified
sync : llama.cpp (ggml/0) f8e8d34 unverified
1.5 bit quantization (llama/5453) 9c3aa6a unverified
ggml : add mmla kernels for quantized GEMM (llama/4966) 0d50a29 unverified
snadampal commited on