whisper.cpp / ggml /src /ggml-musa

Commit History

CUDA: app option to compile without FlashAttention (llama/12025)
fbc5f16

JohannesGaessler commited on

MUSA: support ARM64 and enable dp4a .etc (llama/11843)
ab96dac

Bodhi Bodhi Hu commited on

CUDA: use mma PTX instructions for FlashAttention (llama/11583)
f328957

JohannesGaessler Diego Devesa commited on

ggml : remove old files (skip) (#0)
6284570
unverified

ggerganov HF Staff commited on

ggml : sync remnants (skip) (#0)
451937f
unverified

ggerganov HF Staff commited on

mtgpu: Add MUSA_DOCKER_ARCH in Dockerfiles && update cmake and make (llama/10516)
f2a87fc

R0CKSTAR commited on

ggml : add support for dynamic loading of backends (llama/10469)
b73266f

Diego Devesa ggerganov HF Staff commited on

ggml : sync resolve (skip) (#0)
d4d67dc

ggerganov HF Staff commited on

CUDA: remove DMMV, consolidate F16 mult mat vec (llama/10318)
e446f60

JohannesGaessler commited on

ggml : build backends as libraries (llama/10256)
3dc93f3

Diego Devesa ggerganov HF Staff R0CKSTAR commited on