whisper.cpp / ggml-cuda /softmax.cu

Commit History

ggml : full ALiBi support (llama/7192)
192bda4

ggerganov HF Staff commited on

Fix more int overflow during quant (PPL/CUDA). (llama/6563)
531387f

dranger003 commited on

sync : ggml (#2001)
cbbfa9e
unverified

ggerganov HF Staff commited on