Spaces:
Running
Running
Commit History
Vulkan: Fix float16 use on devices without float16 support + fix subgroup_size_control validation error (llama/11161) 5ad3f1d
SYCL: Refactor ggml_sycl_compute_forward (llama/11121) fa23a38
fix: add missing msg in static_assert (llama/11143) 8c60d6a
llamafile : ppc64le MMA INT8 implementation (llama/10912) 6f18eed
amritahs-ibm commited on
Disable GL_KHR_cooperative_matrix Vulkan extension if not available. (llama/11117) 623b74d
fix: Vulkan shader gen binary path when Cross-compiling (llama/11096) 966a7bb
ag2s20150909 commited on
GGUF: C++ refactor, backend support, misc fixes (llama/11030) 21c5b64
ggml-backend : only offload from host buffers (fix) (llama/11124) 9ac3c7e
Diego Devesa commited on
ggml-backend : only offload from host buffers (llama/11120) 1ca87a8
Diego Devesa commited on
rpc : code cleanup (llama/11107) a0fb22d
SYCL: Use get_multi_ptr instead of deprecated get_pointer in wkv6 (llama/11087) 4ed93cc
CUDA: add BF16 support (llama/11093) 961ef57
Vulkan: Add device-specific blacklist for coopmat for the AMD proprietary driver (llama/11074) 4d90c3d
Support for models with non-512-aligned tensors over RPC. (llama/11047) 895a3a2
fix: Vulkan shader gen binary path (llama/11037) 7008fb8
Gilad S. commited on
ggml : allow loading backend with env variable (ggml/1059) 48aa6d0
ggml : do not install metal source when embed library (ggml/1054) 9615cf2
metal : avoid uint (llama/11019) b788516
ggml : fixes for AVXVNNI instruction set with MSVC and Clang (llama/11027) d13ac16
Srihari-mcw slaren commited on
vulkan: optimize mul_mat for small values of N (llama/10991) 5fc8eea
vulkan: im2col and matmul optimizations for stable diffusion (llama/10942) beef268
vulkan: Use push constant offset to handle misaligned descriptors (llama/10987) 04e729a
vulkan: multi-row k quants (llama/10846) 3bf5be1
Eve commited on
examples, ggml : fix GCC compiler warnings (llama/10983) d7cf559
Peter commited on
ggml : more perfo with llamafile tinyblas on x86_64 (llama/10714) b284406
Djip007 commited on
ggml : use wstring for backend search paths (llama/10960) 656e8b1
Diego Devesa commited on
ggml : fix arm enabled features check (llama/10961) 06cddad
Diego Devesa commited on
ggml : fix const usage in SSE path (llama/10962) 38e6172
Diego Devesa commited on
ggml : fix run-time on FreeBSD in get_executable_path() (llama/10948) 83b02bc
yuri@FreeBSD commited on
vulkan: build fixes for 32b (llama/10927) f1e76ce
vulkan: optimize coopmat2 dequant functions (llama/10855) 5e70c43
ggml-cpu: replace NEON asm with intrinsics in ggml_gemv_q4_0_4x8_q8_0() (llama/10874) 21f8a02
Adrien Gallouët commited on
SYCL: Migrate away from deprecated ggml_tensor->backend (llama/10840) a67a8ec
ggml : add test for SVE and disable when it fails (llama/10906) c90c972
Diego Devesa commited on
ggml: fix arm build with gcc (llama/10895) 43d87cd
Adrien Gallouët commited on
ggml : fix arm build (llama/10890) e58e7a9
Diego Devesa Adrien Gallouët commited on
tts : add OuteTTS support (llama/10784) 8d0f0ac
tests: add tests for GGUF (llama/10830) e7722cb
ggml : improve inputs log sched_print_assignments (ggml/1053) 4427ede
files : remove old sources 1da9474
ggml : update ggml_backend_cpu_device_supports_op (llama/10867) 2f11d1e
vulkan: bugfixes for small subgroup size systems + llvmpipe test (llama/10809) 9220b51
Eve commited on
rwkv6: add wkv6 support for Vulkan backend (llama/10829) c7285d6
Introducing experimental OpenCL backend with support for Qualcomm Adreno GPUs (llama/10693) 83a0899
lhez Skyler Szot Shangqing Gu Alexander Angus Hongqiang Wang Max Krasnyansky commited on
Fix crash caused by ggml_backend_load_all when launching on Android Activity (llama/10812) e1df33d
谢乃闻 Diego Devesa commited on
vulkan: small mul_mat_vec optimizations (llama/10665) ec98109
Eve commited on