ggml-backend : only offload from host buffers (fix) (llama/11124) 9ac3c7e Diego Devesa commited on Jan 7, 2025
ggml-backend : only offload from host buffers (llama/11120) 1ca87a8 Diego Devesa commited on Jan 7, 2025
SYCL: Use get_multi_ptr instead of deprecated get_pointer in wkv6 (llama/11087) 4ed93cc qnixsynapse commited on Jan 7, 2025
Vulkan: Add device-specific blacklist for coopmat for the AMD proprietary driver (llama/11074) 4d90c3d OccamRazor commited on Jan 4, 2025
Support for models with non-512-aligned tensors over RPC. (llama/11047) 895a3a2 Billy462 Diego Devesa commited on Jan 4, 2025
whisper : fix gpu device selection (#2728) 87b427e unverified ggerganov HF Staff commited on Jan 13, 2025
server : generate unique tmp filenames (#2718) 89d94b1 unverified NETZkultur commited on Jan 13, 2025
whisper : add whisper_full_get_segment_no_speech_prob_from_state (#2716) cb32a92 unverified Sandro Hanea commited on Jan 9, 2025
docs: Fix main -> whisper-cli in download scripts (#2707) 4abfe5a unverified Adam Jones commited on Jan 6, 2025
cli : fix segfault on missing argument (#2700) 245a91f unverified Yusuf Redžić commited on Jan 4, 2025
ggml : do not install metal source when embed library (ggml/1054) 9615cf2 ggerganov HF Staff commited on Jan 3, 2025
ggml : fixes for AVXVNNI instruction set with MSVC and Clang (llama/11027) d13ac16 Srihari-mcw slaren commited on Dec 31, 2024
vulkan: optimize mul_mat for small values of N (llama/10991) 5fc8eea jeffbolznv commited on Dec 30, 2024
vulkan: im2col and matmul optimizations for stable diffusion (llama/10942) beef268 jeffbolznv commited on Dec 29, 2024
vulkan: Use push constant offset to handle misaligned descriptors (llama/10987) 04e729a jeffbolznv commited on Dec 29, 2024
ggml : more perfo with llamafile tinyblas on x86_64 (llama/10714) b284406 Djip007 commited on Dec 24, 2024
ggml : use wstring for backend search paths (llama/10960) 656e8b1 Diego Devesa commited on Dec 24, 2024
ggml : fix run-time on FreeBSD in get_executable_path() (llama/10948) 83b02bc yuri@FreeBSD commited on Dec 23, 2024
vulkan: optimize coopmat2 dequant functions (llama/10855) 5e70c43 jeffbolznv commited on Dec 21, 2024
ggml-cpu: replace NEON asm with intrinsics in ggml_gemv_q4_0_4x8_q8_0() (llama/10874) 21f8a02 Adrien Gallouët commited on Dec 20, 2024
SYCL: Migrate away from deprecated ggml_tensor->backend (llama/10840) a67a8ec qnixsynapse commited on Dec 20, 2024
ggml : add test for SVE and disable when it fails (llama/10906) c90c972 Diego Devesa commited on Dec 20, 2024
ggml : improve inputs log sched_print_assignments (ggml/1053) 4427ede danbev commited on Dec 19, 2024
readme : fix real-time audio input example build instructions (#2692) 43720b1 unverified Samuel Durante commited on Jan 2, 2025
docs : replace Core ML with OpenVINO (#2686) db94c1c unverified Konosuke Sakai commited on Jan 2, 2025