whisper.cpp / ggml /src /ggml-cuda /CMakeLists.txt

Commit History

CUDA: compress mode option and default to size (llama/12029)
4ec988a

Green-Sky commited on

CUDA: app option to compile without FlashAttention (llama/12025)
fbc5f16

JohannesGaessler commited on

CUDA: correct the lowest Maxwell supported by CUDA 12 (llama/11984)
6641178

PureJourney JohannesGaessler commited on

cuda : add ampere to the list of default architectures (llama/11870)
1d19dec

Diego Devesa commited on

CUDA: use mma PTX instructions for FlashAttention (llama/11583)
f328957

JohannesGaessler Diego Devesa commited on

ggml : sync remnants (skip) (#0)
451937f
unverified

ggerganov HF Staff commited on

ggml : sync resolve (skip) (#0)
d4d67dc

ggerganov HF Staff commited on