ggml : add AVX512DQ requirement for AVX512 builds (llama/9622) 14b5848 Eric Zhang commited on Sep 24, 2024
log : add CONT level for continuing previous log entry (llama/9610) a29a4c5 ggerganov HF Staff commited on Sep 24, 2024
threads: improve ggml_barrier scaling with large number of threads (llama/9598) aca04d5 Max Krasnyansky commited on Sep 23, 2024
ggml : AVX512 gemm for Q4_0_8_8 (llama/9532) 7349efc Srihari-mcw ggerganov HF Staff commited on Sep 23, 2024
metal : use F32 prec for K*Q in vec FA (llama/9595) 99c4239 ggerganov HF Staff commited on Sep 23, 2024
Revert "[SYCL] fallback mmvq (ggml/9088)" (llama/9579) 5aceb3d Akarshan Biswas commited on Sep 23, 2024
musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80) (llama/9526) 8ec75c3 R0CKSTAR commited on Sep 22, 2024
ggml-alloc : fix list of allocated tensors with GGML_ALLOCATOR_DEBUG (llama/9573) 673df39 slaren commited on Sep 21, 2024
Update CUDA graph on scale change plus clear nodes/params (llama/9550) 6b63eb1 agray3 commited on Sep 21, 2024
CUDA: fix sum.cu compilation for CUDA < 11.7 (llama/9562) b305ecf JohannesGaessler commited on Sep 20, 2024
ggml : fix n_threads_cur initialization with one thread (llama/9538) af82b69 slaren Max Krasnyansky commited on Sep 18, 2024
threadpool : skip polling for unused threads (llama/9461) 9d11a7a Max Krasnyansky commited on Sep 17, 2024
ggml : link MATH_LIBRARY not by its full path (llama/9339) 07d57ec Michael Podvitskiy commited on Sep 16, 2024
cmake : do not hide GGML options + rename option (llama/9465) 8c32d36 ggerganov HF Staff commited on Sep 16, 2024
ggml : ggml_type_name return "NONE" for invalid values (llama/9458) 8a1bb27 Yuri Khrustalev commited on Sep 14, 2024
cmake : use list(APPEND ...) instead of set() + dedup linker (llama/9463) 5497c27 ggerganov HF Staff Michael Podvitskiy commited on Sep 14, 2024
riscv : modify Makefile and add a RISCV_VECT to print log info (llama/9442) f77ad34 Ahmad Tameem commited on Sep 12, 2024
cann: Fix error when running a non-exist op (llama/9424) 74dcc66 Xinpeng Dou commited on Sep 12, 2024
CUDA: fix --split-mode row race condition (llama/9413) b021272 JohannesGaessler commited on Sep 11, 2024
metal : fix compile warning with GGML_METAL_NDEBUG (llama/0) dfc0bf0 ggerganov HF Staff commited on Sep 10, 2024
ggml : vector length agnostic SVE support (llama/9290) 189a444 Prashant Vithule ggerganov HF Staff commited on Sep 9, 2024
CUDA: fix variable name conflict for Windows build (llama/9382) 44d4193 JohannesGaessler commited on Sep 9, 2024
Overlap cmdbuffer creation and cmdbuffer execution in Vulkan backend by submitting smaller cmdbuffers early. (llama/9118) e374798 Markus Tavenrath commited on Sep 8, 2024
add check malloc result on device (llama/9346) cf68be7 Neo Zhang Jianyu arthw commited on Sep 8, 2024
ggml/examples: add backend support for numerical optimization (ggml/949) 5c178b0 JohannesGaessler ggerganov HF Staff slaren commited on Sep 20, 2024
examples : add null threadpool args where needed (ggml/0) 0bb7364 ggerganov HF Staff commited on Sep 8, 2024
metal : update support condition for im2col + fix warning (llama/0) ed9150a ggerganov HF Staff commited on Sep 8, 2024
ggml : always check bounds on get_rows operations (llama/9354) a13c99b slaren commited on Sep 7, 2024
ggml : fix missing `cpu_set_t` on emscripten (llama/9336) d8c76ac Xuan Son Nguyen commited on Sep 7, 2024
ggml-quants : ternary packing for TriLMs and BitNet b1.58 (llama/8151) d1c244a compilade commited on Sep 6, 2024