metal : simplify kernel arguments using a struct (ggml/3229) (llama/12194) 092277a BB-fat alexju commited on Mar 7, 2025
cuda/cpu: Increase support for fp16 unary operations (ggml/1125) 67e8c32 cmdr2 commited on Feb 28, 2025
metal : copy kernels for quant to F32/F16 conversions (llama/12017) 6c8e7ec Garf ggerganov HF Staff commited on Feb 25, 2025
metal : fix the crash caused by the lack of residency set support on Intel Macs. (llama/11904) afbd891 Hale Chan commited on Feb 16, 2025
metal : avoid breaking build when metal API predates TARGET_OS_VISION (llama/11690) 5bdb244 charles-dyfis-net commited on Feb 6, 2025
metal : adjust support conditions for norm operators (llama/11671) 5eb35ab ggerganov HF Staff commited on Feb 5, 2025
CUDA: non-contiguous (RMS) norm support (llama/11659) 4c2e171 JohannesGaessler ggerganov HF Staff commited on Feb 4, 2025
metal: Handle null returned from MTLCreateSystemDefaultDevice() (llama/11441) 4e38ed4 Ihar Hrachyshka commited on Jan 27, 2025
ggml : do not install metal source when embed library (ggml/1054) 9615cf2 ggerganov HF Staff commited on Jan 3, 2025
llama : add Qwen2VL support + multimodal RoPE (llama/10361) 219d12b RzZ ggerganov HF Staff commited on Dec 14, 2024
metal : Extend how Llama.cpp locates metal resources (llama/10676) 44e7250 Robert Ormandi ggerganov HF Staff commited on Dec 7, 2024
ggml: add `GGML_SET` Metal kernel + i32 CPU kernel (ggml/1037) dd775d5 PABannier commited on Dec 4, 2024
metal : fix group_norm support condition (llama/0) 20ee62d ggerganov HF Staff commited on Nov 27, 2024
metal : enable mat-vec kernels for bs <= 4 (llama/10491) 6d07dee ggerganov HF Staff commited on Nov 25, 2024
ggml : add support for dynamic loading of backends (llama/10469) b73266f Diego Devesa ggerganov HF Staff commited on Nov 25, 2024
feat: add `GGML_UNARY_OP_ARGMAX` Metal kernel (ggml/1019) c7e59ef PABannier Diego Devesa commited on Dec 2, 2024
metal : add `GGML_OP_CONV_TRANSPOSE_1D` kernels (ggml/1026) 9c845f4 PABannier commited on Nov 28, 2024
metal : fox offset integer overflows in im2col (ggml/1015) efbd100 pacominev commited on Nov 18, 2024
metal : refactor kernel args into structs (llama/10238) 15659b4 ggerganov HF Staff commited on Nov 17, 2024
ggml : build backends as libraries (llama/10256) 3dc93f3 Diego Devesa ggerganov HF Staff R0CKSTAR commited on Nov 14, 2024