Commit History

ggml : do not define GGML_USE_CUDA when building with GGML_BACKEND_DL (llama/11211)
79f750d

rgerganov commited on

Vulkan: Fix float16 use on devices without float16 support + fix subgroup_size_control validation error (llama/11161)
5ad3f1d

OccamRazor commited on

SYCL: Refactor ggml_sycl_compute_forward (llama/11121)
fa23a38

qnixsynapse commited on

fix: add missing msg in static_assert (llama/11143)
8c60d6a

hydaitw commited on

llamafile : ppc64le MMA INT8 implementation (llama/10912)
6f18eed

amritahs-ibm commited on

Disable GL_KHR_cooperative_matrix Vulkan extension if not available. (llama/11117)
623b74d

mbaudier commited on

fix: Vulkan shader gen binary path when Cross-compiling (llama/11096)
966a7bb

ag2s20150909 commited on

GGUF: C++ refactor, backend support, misc fixes (llama/11030)
21c5b64

JohannesGaessler commited on

ggml-backend : only offload from host buffers (fix) (llama/11124)
9ac3c7e

Diego Devesa commited on

ggml-backend : only offload from host buffers (llama/11120)
1ca87a8

Diego Devesa commited on

rpc : code cleanup (llama/11107)
a0fb22d

rgerganov commited on

SYCL: Use get_multi_ptr instead of deprecated get_pointer in wkv6 (llama/11087)
4ed93cc

qnixsynapse commited on

CUDA: add BF16 support (llama/11093)
961ef57

JohannesGaessler commited on

Vulkan: Add device-specific blacklist for coopmat for the AMD proprietary driver (llama/11074)
4d90c3d

OccamRazor commited on

Support for models with non-512-aligned tensors over RPC. (llama/11047)
895a3a2

Billy462 Diego Devesa commited on

fix: Vulkan shader gen binary path (llama/11037)
7008fb8

Gilad S. commited on

ggml : allow loading backend with env variable (ggml/1059)
48aa6d0

rgerganov commited on

ggml : do not install metal source when embed library (ggml/1054)
9615cf2

ggerganov HF Staff commited on

metal : avoid uint (llama/11019)
b788516

ggerganov HF Staff commited on

ggml : fixes for AVXVNNI instruction set with MSVC and Clang (llama/11027)
d13ac16

Srihari-mcw slaren commited on

vulkan: optimize mul_mat for small values of N (llama/10991)
5fc8eea

jeffbolznv commited on

vulkan: im2col and matmul optimizations for stable diffusion (llama/10942)
beef268

jeffbolznv commited on

vulkan: Use push constant offset to handle misaligned descriptors (llama/10987)
04e729a

jeffbolznv commited on

vulkan: multi-row k quants (llama/10846)
3bf5be1

Eve commited on

examples, ggml : fix GCC compiler warnings (llama/10983)
d7cf559

Peter commited on

ggml : more perfo with llamafile tinyblas on x86_64 (llama/10714)
b284406

Djip007 commited on

ggml : use wstring for backend search paths (llama/10960)
656e8b1

Diego Devesa commited on

ggml : fix arm enabled features check (llama/10961)
06cddad

Diego Devesa commited on

ggml : fix const usage in SSE path (llama/10962)
38e6172

Diego Devesa commited on

ggml : fix run-time on FreeBSD in get_executable_path() (llama/10948)
83b02bc

yuri@FreeBSD commited on

vulkan: build fixes for 32b (llama/10927)
f1e76ce

jeffbolznv commited on

vulkan: optimize coopmat2 dequant functions (llama/10855)
5e70c43

jeffbolznv commited on

ggml-cpu: replace NEON asm with intrinsics in ggml_gemv_q4_0_4x8_q8_0() (llama/10874)
21f8a02

Adrien Gallouët commited on

SYCL: Migrate away from deprecated ggml_tensor->backend (llama/10840)
a67a8ec

qnixsynapse commited on

ggml : add test for SVE and disable when it fails (llama/10906)
c90c972

Diego Devesa commited on

ggml: fix arm build with gcc (llama/10895)
43d87cd

Adrien Gallouët commited on

ggml : fix arm build (llama/10890)
e58e7a9

Diego Devesa Adrien Gallouët commited on

tts : add OuteTTS support (llama/10784)
8d0f0ac

ggerganov HF Staff commited on

tests: add tests for GGUF (llama/10830)
e7722cb

JohannesGaessler commited on

ggml : improve inputs log sched_print_assignments (ggml/1053)
4427ede

danbev commited on

files : remove old sources
1da9474

ggerganov HF Staff commited on

ggml : update ggml_backend_cpu_device_supports_op (llama/10867)
2f11d1e

ggerganov HF Staff commited on

vulkan: bugfixes for small subgroup size systems + llvmpipe test (llama/10809)
9220b51

Eve commited on

rwkv6: add wkv6 support for Vulkan backend (llama/10829)
c7285d6

Zhiyuan Li mollysama commited on

llama : add Qwen2VL support + multimodal RoPE (llama/10361)
219d12b

RzZ ggerganov HF Staff commited on

Introducing experimental OpenCL backend with support for Qualcomm Adreno GPUs (llama/10693)
83a0899

lhez Skyler Szot Shangqing Gu Alexander Angus Hongqiang Wang Max Krasnyansky commited on

Fix crash caused by ggml_backend_load_all when launching on Android Activity (llama/10812)
e1df33d

谢乃闻 Diego Devesa commited on

vulkan: small mul_mat_vec optimizations (llama/10665)
ec98109

Eve commited on

SYCL: Reduce most of the compiler warnings (llama/10748)
050e6ce

qnixsynapse Abhilash Majumder commited on