metal : optimize ggml_mul_mat_id (faster Mixtral PP) (llama/4725) 8bc6274 ggerganov HF Staff commited on Jan 2, 2024
metal : enable shader debugging (cmake option) (llama/4705) 7dd37dc ggerganov HF Staff commited on Jan 2, 2024
sync : ggml (ggml_scale, ggml_row_size, etc.) (#1677) aa86ade unverified ggerganov HF Staff commited on Dec 22, 2023
sync : ggml (Metal fixes, new ops, tests) (#1633) a0d4b48 unverified ggerganov HF Staff commited on Dec 13, 2023
sync : ggml (new ops, new backend, etc) (#1602) 895e87a unverified ggerganov HF Staff commited on Dec 7, 2023
whisper : add full CUDA and Metal offloading (#1472) da4acca unverified ggerganov HF Staff commited on Nov 12, 2023
sync : ggml (backend v2, k-quants, CUDA opts, Metal opts, etc.) (#1422) 7006035 unverified ggerganov HF Staff Chris Raethke commited on Nov 3, 2023
metal : add F32 support + update bench output 02d7878 unverified ggerganov HF Staff commited on Sep 15, 2023
whisper : Metal and ggml-alloc support (#1270) 714ee6b unverified ggerganov HF Staff commited on Sep 15, 2023
sync : ggml (HBM + Metal + style) (#1264) 88deeba unverified ggerganov HF Staff commited on Sep 8, 2023
ggml : sync latest llama.cpp (view_src + alloc improvements) (#1247) 8bb66c1 unverified ggerganov HF Staff commited on Sep 5, 2023
ggml : sync (ggml-alloc, GPU, eps, etc.) (#1220) d41ba35 unverified ggerganov HF Staff commited on Sep 5, 2023
ggml : sync latest repo (mostly refactoring changes) d97fd69 unverified ggerganov HF Staff commited on Jul 2, 2023