ggml : sync latest llama.cpp (view_src + alloc improvements) (#1247) 8bb66c1 unverified ggerganov HF Staff commited on Sep 5, 2023
ggml : sync (ggml-alloc, GPU, eps, etc.) (#1220) d41ba35 unverified ggerganov HF Staff commited on Sep 5, 2023
ggml : fix compilation errors incurred by -Werror (#1227) 45ef7b5 unverified ChangSeok Oh commited on Aug 30, 2023
ggml : fix compiling when SSE3 is available but not SSSE3 (#1210) b7995b7 unverified Przemysław Pawełczyk commited on Aug 27, 2023
ci : more platforms coverage (#1101) c4448fa unverified alonfaraj Alon Faraj commited on Jul 16, 2023
Revert "ggml : do not use _GNU_SOURCE gratuitously (#1027)" 1e5ddb0 unverified ggerganov HF Staff commited on Jul 2, 2023
ggml : sync latest repo (mostly refactoring changes) d97fd69 unverified ggerganov HF Staff commited on Jul 2, 2023
ggml : do not use _GNU_SOURCE gratuitously (#1027) 3a69cdf unverified Przemysław Pawełczyk commited on Jun 25, 2023
ggml : sync ggml (clBLAST + tensor names) f50d3b3 unverified ggerganov HF Staff commited on May 2, 2023
whisper : add integer quantization support (#540) a5f8f3c unverified ggerganov HF Staff commited on Apr 30, 2023
ggml : use vzip instead of vuzp for consistency 741db99 unverified ggerganov HF Staff commited on Apr 29, 2023
ggml : sync with ggml repo (warning fixes + asserts) caf2759 unverified ggerganov HF Staff commited on Apr 29, 2023
ggml : sync latest ggml + llama.cpp updates (quantization) ede1268 unverified ggerganov HF Staff commited on Apr 29, 2023
ggml, ci : fix build on whisper.android (ARM_NEON) + add CI (#764) dedf05b unverified jhenhong commited on Apr 15, 2023
ggml : fix q4_1 dot product types (#759) 984a856 unverified novag ggerganov HF Staff commited on Apr 14, 2023
ggml : sync latest changes from ggml and llama.cpp 3bd52ce unverified ggerganov HF Staff commited on Apr 13, 2023
ggml : backport llama.cpp updates (close #709) bf6b4f8 unverified ggerganov HF Staff commited on Apr 10, 2023
talk-llama : add new example + sync ggml from llama.cpp (#664) a8c74e6 unverified ggerganov HF Staff commited on Mar 27, 2023
whisper : reduce memory usage during inference (#431) 3aa9e6c unverified ggerganov HF Staff commited on Feb 4, 2023
bench : add memcpy and ggml_mul_mat benchmarks a660ed9 unverified ggerganov HF Staff commited on Jan 18, 2023
ggml : remove obsolete zeroing + comment fixes (#390) 9c35c0d unverified ggerganov HF Staff commited on Jan 8, 2023
ggml : correct behaviour of ggml_vec_sum_f32 (#390) ffffc6e unverified Abitofevrything commited on Jan 8, 2023
ggml : improve vec_dot_f16 unrolling in flash_attn_f16 6e57274 unverified ggerganov HF Staff commited on Jan 8, 2023
ggml : fix bug in new soft max computation c59ce76 unverified ggerganov HF Staff commited on Jan 7, 2023
ggml : when using BLAS start only 1 CPU thread 6c4692f unverified ggerganov HF Staff commited on Jan 7, 2023
ggml : fix running tasks with variable number of threads 2078d85 unverified ggerganov HF Staff commited on Jan 7, 2023
ggml : unroll ggml_vec_dot_f16 in ggml_compute_forward_flash_attn_f16 f07fecd unverified ggerganov HF Staff commited on Jan 7, 2023
ggml : speed-up soft max via Accelerate + unroll fdaf59a unverified ggerganov HF Staff commited on Jan 7, 2023
ggml : use vDSP_sve and vDSP_maxv from Accelerate ed14a8b unverified ggerganov HF Staff commited on Jan 7, 2023
ggml : add SSE3 and fp16 conversion lookup table (#368) 2c3f7d4 unverified Abitofevrything ggerganov HF Staff commited on Jan 6, 2023