ggml-cpu: Faster IQ1 mul_mat_vec on AVX2 using BMI2 instructions (llama/12154) 05466a9 Rémy O commited on Mar 6, 2025
ggml : portability fixes for VS 2017 (llama/12150) 49e3343 mgroeber9110 Marcus Groeber commited on Mar 4, 2025
ggml : upgrade init_tensor API to return a ggml_status (llama/11854) d6b6852 William Tambellini slaren commited on Feb 28, 2025
ggml-cpu: Support s390x SIMD Instruction Set (llama/12019) 4aa54ec Aaron Teo Jinyang He junchao-zhao commited on Feb 22, 2025
ggml-cpu: Add CPU backend support for KleidiAI library (llama/11390) 9de6d81 Charles Xu commited on Feb 20, 2025
cleanup: fix compile warnings associated with gnu_printf (llama/11811) ef6a968 bandoti commited on Feb 12, 2025
vulkan: Make Vulkan optional at runtime (ggml/11493). (llama/11494) 762f497 Danny Milosavljevic jeffbolznv commited on Feb 10, 2025
CUDA: use mma PTX instructions for FlashAttention (llama/11583) f328957 JohannesGaessler Diego Devesa commited on Feb 2, 2025
CUDA: backwards pass for misc. ops, add tests (llama/11257) 2fbcec1 JohannesGaessler commited on Jan 16, 2025
RoPE: fix back, CUDA support for back + noncont. (llama/11240) 131a21e JohannesGaessler commited on Jan 15, 2025
GGUF: C++ refactor, backend support, misc fixes (skip) (llama/11030) 92311a3 JohannesGaessler commited on Jan 14, 2025
llama: add support for QRWKV6 model architecture (llama/11001) 4a6b7e0 mollysama ggerganov HF Staff compilade commited on Jan 10, 2025
GGUF: C++ refactor, backend support, misc fixes (llama/11030) 21c5b64 JohannesGaessler commited on Jan 7, 2025
llama : add Qwen2VL support + multimodal RoPE (llama/10361) 219d12b RzZ ggerganov HF Staff commited on Dec 14, 2024
Introducing experimental OpenCL backend with support for Qualcomm Adreno GPUs (llama/10693) 83a0899 lhez Skyler Szot Shangqing Gu Alexander Angus Hongqiang Wang Max Krasnyansky commited on Dec 13, 2024
ggml: load all backends from a user-provided search path (llama/10699) c6de218 Gilad S Diego Devesa commited on Dec 11, 2024
ggml : refactor online repacking (llama/10446) 163128e Djip007 ggerganov HF Staff commited on Dec 7, 2024
ggml-cpu: support IQ4_NL_4_4 by runtime repack (llama/10541) bf73242 shupeif commited on Nov 28, 2024
ggml : add support for dynamic loading of backends (llama/10469) b73266f Diego Devesa ggerganov HF Staff commited on Nov 25, 2024
backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels (llama/9921) 3541ee8 Charles Xu Diego Devesa commited on Nov 15, 2024
ggml : build backends as libraries (llama/10256) 3dc93f3 Diego Devesa ggerganov HF Staff R0CKSTAR commited on Nov 14, 2024
Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (llama/10133) f58e658 Zhiyuan Li ggerganov HF Staff Diego Devesa pacominev Yuri Khrustalev Meng, Hengyu commited on Nov 7, 2024
ggml : move CPU backend to a separate file (llama/10144) 0f447f2 Diego Devesa commited on Nov 3, 2024
llama : add simple-chat example (llama/10124) 41ff26f Diego Devesa Xuan Son Nguyen commited on Nov 1, 2024
llama : use smart pointers for ggml resources (llama/10117) 6b82135 Diego Devesa commited on Nov 1, 2024
kompute: add backend registry / device interfaces (llama/10045) b612415 slpnix commited on Oct 30, 2024
llama : refactor model loader with backend registry (llama/10026) 582a21e Diego Devesa commited on Oct 30, 2024
Adapt to dynamically loadable backends mechanism (llama/9970) f8d4728 leo-pony commited on Oct 22, 2024
Add SYCL Backend registry, device and Event Interfaces (llama/9705) f35cae5 Ouadie EL FAROUKI commited on Oct 18, 2024
vulkan : add backend registry / device interfaces (llama/9721) df2cb6e Diego Devesa commited on Oct 17, 2024
rpc : add backend registry / device interfaces (llama/9812) 4ac768e Diego Devesa commited on Oct 10, 2024
ggml : add backend registry / device interfaces to BLAS backend (llama/9752) 7f269bb Diego Devesa commited on Oct 7, 2024
ggml : add metal backend registry / device (llama/9713) b6adf19 ggerganov HF Staff slaren commited on Oct 7, 2024
ggml : alloc ggml_contexts on the heap (#2525) 3ccf40a unverified ggerganov HF Staff commited on Oct 31, 2024
ggml-backend : add device and backend reg interfaces (llama/9707) 9d74d85 Diego Devesa commited on Oct 3, 2024
ggml-backend : add device and backend reg interfaces (llama/9707) 1bdb50a Diego Devesa JohannesGaessler commited on Oct 2, 2024
ggml/ex: calculate accuracy in graph, adapt MNIST (ggml/980) 52069b8 JohannesGaessler commited on Oct 3, 2024
ggml: refactor cross entropy loss CPU impl. (ggml/976) 2a0805f JohannesGaessler commited on Oct 2, 2024