Commit History

metal : fox offset integer overflows in im2col (ggml/1015)
efbd100

pacominev commited on

Vulkan: Fix device info output format specifiers (llama/10366)
8000df9

OccamRazor commited on

metal : add `GGML_UNARY_OP_ELU` kernel (ggml/1018)
5959420

PABannier commited on

CUDA: fix MMV kernel being used for FP16 src1 (llama/10357)
af4dff1

JohannesGaessler commited on

CMake: fix typo in comment [no ci] (llama/10360)
d324d0b

JohannesGaessler commited on

llama : only use default buffer types for the KV cache (llama/10358)
9e9c0ad

Diego Devesa commited on

metal : refactor kernel args into structs (llama/10238)
15659b4

ggerganov HF Staff commited on

ggml : fix undefined reference to 'getcpu' (llama/10354)
2f9b147

FirstTimeEZ commited on

CUDA: remove DMMV, consolidate F16 mult mat vec (llama/10318)
e446f60

JohannesGaessler commited on

CMake: default to -arch=native for CUDA build (llama/10320)
66edfb6

JohannesGaessler commited on

ggml : fix possible buffer use after free in sched reserve (llama/9930)
4703ea3

Diego Devesa commited on

ggml : inttypes.h -> cinttypes (llama/0)
6ba2c8f

ggerganov HF Staff commited on

ggml : adapt AMX to tensor->grad removal (llama/0)
8a67e9f

ggerganov HF Staff commited on

ggml : fix compile warnings (llama/0)
80d6ec0

ggerganov HF Staff commited on

llamafile : fix include path (llama/0)
e443f89

ggerganov HF Staff commited on

vulkan: Optimize some mat-vec mul quant shaders (llama/10296)
dc0e685

jeffbolznv commited on

ggml : optimize Q4_0 into Q4_0_X_Y repack (llama/10324)
abf6f22

Dan Johansson commited on

Make updates to fix issues with clang-cl builds while using AVX512 flags (llama/10314)
2868c2b

Srihari-mcw commited on

ggml: new optimization interface (ggml/988)
dd33ace

JohannesGaessler commited on

ggml : remove duplicated sources from the last sync (ggml/1017)
026d20b

ggerganov HF Staff commited on

ggml : fix some build issues
c5ba1d1

slaren commited on

sync : leftovers (ggml/0)
0f6c498

ggerganov HF Staff commited on

cmake : restore CMakeLists.txt (llama/10256)
51a70ff

ggerganov HF Staff commited on

AVX BF16 and single scale quant optimizations (llama/10212)
e6ffed3

Eve commited on

sycl: Use syclcompat::dp4a (llama/10267)
ce0dc30

Romain Biessy commited on

backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels (llama/9921)
3541ee8

Charles Xu Diego Devesa commited on

ggml : build backends as libraries (llama/10256)
3dc93f3

Diego Devesa ggerganov HF Staff R0CKSTAR commited on

scripts : update sync
1741306

ggerganov HF Staff commited on

release : v1.7.2
414329d
unverified

ggerganov HF Staff commited on

sycl: fix example build (#2570)
a0dcffc
unverified

Stefan Sydow commited on

ci : use local ggml in Android build (#2567)
72b7501
unverified

ggerganov HF Staff commited on

ggml : tmp workaround for whisper.cpp (skip) (#2565)
ef26f48
unverified

ggerganov HF Staff commited on

update : readme
d1fa03c
unverified

ggerganov HF Staff commited on

scripts : fix sync path
9a2f912
unverified

ggerganov HF Staff commited on

whisper.swiftui : switch Mac dest to Mac (Designed for iPad) (#2562)
13f2beb
unverified

jhenhong commited on

cmake : fix ppc64 check (#0)
f3c3fca

ggerganov HF Staff commited on

whisper : include ggml-cpu.h (#0)
cb35171

ggerganov HF Staff commited on

build : fixes
11d19cb

ggerganov HF Staff commited on

talk-llama : sync llama.cpp
6bb34fb

ggerganov HF Staff commited on

whisper : fix build (#0)
dfd316d

ggerganov HF Staff commited on

sync : ggml
9e83be6

ggerganov HF Staff commited on

sycl : Fixes to broken builds and test-backend-ops (llama/10257)
9cfb13b

Alberto Cabrera Pérez commited on

vulkan: Optimize contiguous copies (llama/10254)
9974bd6

jeffbolznv commited on

vulkan: Throttle the number of shader compiles during the build step. (llama/10222)
9677a2f

jeffbolznv commited on

metal : more precise Q*K in FA vec kernel (llama/10247)
9160e8f

ggerganov HF Staff commited on

vulkan: Fix newly added tests for permuted mul_mat and 1D im2col (llama/10226)
76b8073

jeffbolznv commited on

metal : reorder write loop in mul mat kernel + style (llama/10231)
661360d

ggerganov HF Staff commited on

metal : fix build and some more comments (llama/10229)
93fc215

ggerganov HF Staff commited on

metal : fix F32 accumulation in FA vec kernel (llama/10232)
228e0b2

ggerganov HF Staff commited on

metal : hide debug messages from normal log
efefcbb

ggerganov HF Staff commited on