whisper.cpp

Running

App Files Files Community

whisper.cpp

Commit History

ggml : add AVX512DQ requirement for AVX512 builds (llama/9622)

14b5848

Eric Zhang commited on Sep 24, 2024

log : add CONT level for continuing previous log entry (llama/9610)

a29a4c5

ggerganov HF Staff commited on Sep 24, 2024

threads: fix msvc build without openmp (llama/9615)

97b3eb5

Max Krasnyansky commited on Sep 24, 2024

cuda: add q8_0->f32 cpy operation (llama/9571)

6201c74

Nekotekina commited on Sep 24, 2024

threads: improve ggml_barrier scaling with large number of threads (llama/9598)

aca04d5

Max Krasnyansky commited on Sep 23, 2024

ggml : AVX512 gemm for Q4_0_8_8 (llama/9532)

7349efc

Srihari-mcw

ggerganov HF Staff commited on Sep 23, 2024

metal : use F32 prec for K*Q in vec FA (llama/9595)

99c4239

ggerganov HF Staff commited on Sep 23, 2024

Revert "[SYCL] fallback mmvq (ggml/9088)" (llama/9579)

5aceb3d

Akarshan Biswas commited on Sep 23, 2024

musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80) (llama/9526)

8ec75c3

R0CKSTAR commited on Sep 22, 2024

Fix merge error in #9454 (llama/9589)

3142fa9

mollysama commited on Sep 22, 2024

CUDA: enable Gemma FA for HIP/Pascal (llama/9581)

97cb7ce

JohannesGaessler commited on Sep 22, 2024

RWKV v6: RWKV_WKV op CUDA implementation (llama/9454)

8d3e707

mollysama commited on Sep 22, 2024

ggml-alloc : fix list of allocated tensors with GGML_ALLOCATOR_DEBUG (llama/9573)

673df39

slaren commited on Sep 21, 2024

Update CUDA graph on scale change plus clear nodes/params (llama/9550)

6b63eb1

agray3 commited on Sep 21, 2024

examples : adapt to ggml.h changes (ggml/0)

91c7734

ggerganov HF Staff commited on Sep 20, 2024

ggml : refactoring (llama/#0)

1b62c96

ggerganov HF Staff commited on Sep 20, 2024

ggml : fix builds (llama/0)

524a01b

ggerganov HF Staff commited on Sep 20, 2024

ggml : fix trailing whitespace (llama/0)

214f95e

ggerganov HF Staff commited on Sep 20, 2024

CUDA: fix sum.cu compilation for CUDA < 11.7 (llama/9562)

b305ecf

JohannesGaessler commited on Sep 20, 2024

ggml : fix n_threads_cur initialization with one thread (llama/9538)

af82b69

slaren Max Krasnyansky commited on Sep 18, 2024

threadpool : skip polling for unused threads (llama/9461)

9d11a7a

Max Krasnyansky commited on Sep 17, 2024

ggml : link MATH_LIBRARY not by its full path (llama/9339)

07d57ec

Michael Podvitskiy commited on Sep 16, 2024

cmake : do not hide GGML options + rename option (llama/9465)

8c32d36

ggerganov HF Staff commited on Sep 16, 2024

ggml : IQ4_NL sgemm + Q4_0 AVX optimization (llama/9422)

f2986f6

Eve commited on Sep 16, 2024

metal : handle zero-sized allocs (llama/9466)

868283e

ggerganov HF Staff commited on Sep 16, 2024

common : reimplement logging (llama/9418)

e893c97

ggerganov HF Staff commited on Sep 15, 2024

cmake : correct order of sycl flags (llama/9497)

45ddbb5

Michael Podvitskiy commited on Sep 15, 2024

cmake : try to fix sycl+intel build (llama/9487)

dd66fc9

Michael Podvitskiy commited on Sep 15, 2024

ggml : ggml_type_name return "NONE" for invalid values (llama/9458)

8a1bb27

Yuri Khrustalev commited on Sep 14, 2024

cmake : use list(APPEND ...) instead of set() + dedup linker (llama/9463)

5497c27

ggerganov HF Staff Michael Podvitskiy commited on Sep 14, 2024

cann: Add host buffer type for Ascend NPU (llama/9406)

7cbca42

Dou Xinpeng commited on Sep 12, 2024

riscv : modify Makefile and add a RISCV_VECT to print log info (llama/9442)

f77ad34

Ahmad Tameem commited on Sep 12, 2024

cann: Fix error when running a non-exist op (llama/9424)

74dcc66

Xinpeng Dou commited on Sep 12, 2024

CUDA: fix --split-mode row race condition (llama/9413)

b021272

JohannesGaessler commited on Sep 11, 2024

musa: remove Clang builtins mapping (llama/9421)

ba2469d

R0CKSTAR commited on Sep 11, 2024

sycl : update support conditions (llama/9394)

9a876d1

Alberto Cabrera Pérez commited on Sep 11, 2024

metal : fix compile warning with GGML_METAL_NDEBUG (llama/0)

dfc0bf0

ggerganov HF Staff commited on Sep 10, 2024

rpc : fix segfault with nkvo (llama/9389)

66ce884

rgerganov slaren commited on Sep 9, 2024

ggml : vector length agnostic SVE support (llama/9290)

189a444

Prashant Vithule

ggerganov HF Staff commited on Sep 9, 2024

CUDA: fix variable name conflict for Windows build (llama/9382)

44d4193

JohannesGaessler commited on Sep 9, 2024

Overlap cmdbuffer creation and cmdbuffer execution in Vulkan backend by submitting smaller cmdbuffers early. (llama/9118)

e374798

Markus Tavenrath commited on Sep 8, 2024

cuda : fix FA Q src index (1 -> 0) (llama/9374)

8cfb955

ggerganov HF Staff commited on Sep 8, 2024

add check malloc result on device (llama/9346)

cf68be7

Neo Zhang Jianyu arthw commited on Sep 8, 2024

ggml/examples: add backend support for numerical optimization (ggml/949)

5c178b0

JohannesGaessler

ggerganov HF Staff slaren commited on Sep 20, 2024

examples : add null threadpool args where needed (ggml/0)

0bb7364

ggerganov HF Staff commited on Sep 8, 2024

metal : update support condition for im2col + fix warning (llama/0)

ed9150a

ggerganov HF Staff commited on Sep 8, 2024

ggml : always check bounds on get_rows operations (llama/9354)

a13c99b

slaren commited on Sep 7, 2024

ggml : fix missing `cpu_set_t` on emscripten (llama/9336)

d8c76ac

Xuan Son Nguyen commited on Sep 7, 2024

Improve Vulkan shader build system (llama/9239)

9746f77

Markus Tavenrath commited on Sep 6, 2024

ggml-quants : ternary packing for TriLMs and BitNet b1.58 (llama/8151)

d1c244a

compilade commited on Sep 6, 2024

Commit History

ggml : add AVX512DQ requirement for AVX512 builds (llama/9622) 14b5848

log : add CONT level for continuing previous log entry (llama/9610) a29a4c5

threads: fix msvc build without openmp (llama/9615) 97b3eb5

cuda: add q8_0->f32 cpy operation (llama/9571) 6201c74

threads: improve ggml_barrier scaling with large number of threads (llama/9598) aca04d5

ggml : AVX512 gemm for Q4_0_8_8 (llama/9532) 7349efc

metal : use F32 prec for K*Q in vec FA (llama/9595) 99c4239

Revert "[SYCL] fallback mmvq (ggml/9088)" (llama/9579) 5aceb3d

musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80) (llama/9526) 8ec75c3

Fix merge error in #9454 (llama/9589) 3142fa9

CUDA: enable Gemma FA for HIP/Pascal (llama/9581) 97cb7ce

RWKV v6: RWKV_WKV op CUDA implementation (llama/9454) 8d3e707

ggml-alloc : fix list of allocated tensors with GGML_ALLOCATOR_DEBUG (llama/9573) 673df39

Update CUDA graph on scale change plus clear nodes/params (llama/9550) 6b63eb1

examples : adapt to ggml.h changes (ggml/0) 91c7734

ggml : refactoring (llama/#0) 1b62c96

ggml : fix builds (llama/0) 524a01b

ggml : fix trailing whitespace (llama/0) 214f95e

CUDA: fix sum.cu compilation for CUDA < 11.7 (llama/9562) b305ecf

ggml : fix n_threads_cur initialization with one thread (llama/9538) af82b69

threadpool : skip polling for unused threads (llama/9461) 9d11a7a

ggml : link MATH_LIBRARY not by its full path (llama/9339) 07d57ec

cmake : do not hide GGML options + rename option (llama/9465) 8c32d36

ggml : IQ4_NL sgemm + Q4_0 AVX optimization (llama/9422) f2986f6

metal : handle zero-sized allocs (llama/9466) 868283e

common : reimplement logging (llama/9418) e893c97

cmake : correct order of sycl flags (llama/9497) 45ddbb5

cmake : try to fix sycl+intel build (llama/9487) dd66fc9

ggml : ggml_type_name return "NONE" for invalid values (llama/9458) 8a1bb27

cmake : use list(APPEND ...) instead of set() + dedup linker (llama/9463) 5497c27

cann: Add host buffer type for Ascend NPU (llama/9406) 7cbca42

riscv : modify Makefile and add a RISCV_VECT to print log info (llama/9442) f77ad34

cann: Fix error when running a non-exist op (llama/9424) 74dcc66

CUDA: fix --split-mode row race condition (llama/9413) b021272

musa: remove Clang builtins mapping (llama/9421) ba2469d

sycl : update support conditions (llama/9394) 9a876d1

metal : fix compile warning with GGML_METAL_NDEBUG (llama/0) dfc0bf0

rpc : fix segfault with nkvo (llama/9389) 66ce884

ggml : vector length agnostic SVE support (llama/9290) 189a444

CUDA: fix variable name conflict for Windows build (llama/9382) 44d4193

Overlap cmdbuffer creation and cmdbuffer execution in Vulkan backend by submitting smaller cmdbuffers early. (llama/9118) e374798

cuda : fix FA Q src index (1 -> 0) (llama/9374) 8cfb955

add check malloc result on device (llama/9346) cf68be7

ggml/examples: add backend support for numerical optimization (ggml/949) 5c178b0

examples : add null threadpool args where needed (ggml/0) 0bb7364

metal : update support condition for im2col + fix warning (llama/0) ed9150a

ggml : always check bounds on get_rows operations (llama/9354) a13c99b

ggml : fix missing `cpu_set_t` on emscripten (llama/9336) d8c76ac

Improve Vulkan shader build system (llama/9239) 9746f77

ggml-quants : ternary packing for TriLMs and BitNet b1.58 (llama/8151) d1c244a

ggml : add AVX512DQ requirement for AVX512 builds (llama/9622)

14b5848

log : add CONT level for continuing previous log entry (llama/9610)

a29a4c5

threads: fix msvc build without openmp (llama/9615)

97b3eb5

cuda: add q8_0->f32 cpy operation (llama/9571)

6201c74

threads: improve ggml_barrier scaling with large number of threads (llama/9598)

aca04d5

ggml : AVX512 gemm for Q4_0_8_8 (llama/9532)

7349efc

metal : use F32 prec for K*Q in vec FA (llama/9595)

99c4239

Revert "[SYCL] fallback mmvq (ggml/9088)" (llama/9579)

5aceb3d

musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80) (llama/9526)

8ec75c3

Fix merge error in #9454 (llama/9589)

3142fa9

CUDA: enable Gemma FA for HIP/Pascal (llama/9581)

97cb7ce

RWKV v6: RWKV_WKV op CUDA implementation (llama/9454)

8d3e707

ggml-alloc : fix list of allocated tensors with GGML_ALLOCATOR_DEBUG (llama/9573)

673df39

Update CUDA graph on scale change plus clear nodes/params (llama/9550)

6b63eb1

examples : adapt to ggml.h changes (ggml/0)

91c7734

ggml : refactoring (llama/#0)

1b62c96

ggml : fix builds (llama/0)

524a01b

ggml : fix trailing whitespace (llama/0)

214f95e

CUDA: fix sum.cu compilation for CUDA < 11.7 (llama/9562)

b305ecf

ggml : fix n_threads_cur initialization with one thread (llama/9538)

af82b69

threadpool : skip polling for unused threads (llama/9461)

9d11a7a

ggml : link MATH_LIBRARY not by its full path (llama/9339)

07d57ec

cmake : do not hide GGML options + rename option (llama/9465)

8c32d36

ggml : IQ4_NL sgemm + Q4_0 AVX optimization (llama/9422)

f2986f6

metal : handle zero-sized allocs (llama/9466)

868283e

common : reimplement logging (llama/9418)

e893c97

cmake : correct order of sycl flags (llama/9497)

45ddbb5

cmake : try to fix sycl+intel build (llama/9487)

dd66fc9

ggml : ggml_type_name return "NONE" for invalid values (llama/9458)

8a1bb27

cmake : use list(APPEND ...) instead of set() + dedup linker (llama/9463)

5497c27

cann: Add host buffer type for Ascend NPU (llama/9406)

7cbca42

riscv : modify Makefile and add a RISCV_VECT to print log info (llama/9442)

f77ad34

cann: Fix error when running a non-exist op (llama/9424)

74dcc66

CUDA: fix --split-mode row race condition (llama/9413)

b021272

musa: remove Clang builtins mapping (llama/9421)

ba2469d

sycl : update support conditions (llama/9394)

9a876d1

metal : fix compile warning with GGML_METAL_NDEBUG (llama/0)

dfc0bf0

rpc : fix segfault with nkvo (llama/9389)

66ce884

ggml : vector length agnostic SVE support (llama/9290)

189a444

CUDA: fix variable name conflict for Windows build (llama/9382)

44d4193

Overlap cmdbuffer creation and cmdbuffer execution in Vulkan backend by submitting smaller cmdbuffers early. (llama/9118)

e374798

cuda : fix FA Q src index (1 -> 0) (llama/9374)

8cfb955

add check malloc result on device (llama/9346)

cf68be7

ggml/examples: add backend support for numerical optimization (ggml/949)

5c178b0

examples : add null threadpool args where needed (ggml/0)

0bb7364

metal : update support condition for im2col + fix warning (llama/0)

ed9150a

ggml : always check bounds on get_rows operations (llama/9354)

a13c99b

ggml : fix missing `cpu_set_t` on emscripten (llama/9336)

d8c76ac

Improve Vulkan shader build system (llama/9239)

9746f77

ggml-quants : ternary packing for TriLMs and BitNet b1.58 (llama/8151)

d1c244a