whisper.cpp

Running

App Files Files Community

whisper.cpp / ggml /src /ggml-metal

Commit History

metal : simplify kernel arguments using a struct (ggml/3229) (llama/12194)

092277a

BB-fat alexju commited on Mar 7, 2025

metal : fix default.metallib build (llama/12224)

838efb6

danbev commited on Mar 7, 2025

ggml : fix GGMLMetalClass ODR (llama/12200)

2094cb7

pacominev commited on Mar 5, 2025

cuda/cpu: Increase support for fp16 unary operations (ggml/1125)

67e8c32

cmdr2 commited on Feb 28, 2025

metal : copy kernels for quant to F32/F16 conversions (llama/12017)

6c8e7ec

Garf

ggerganov HF Staff commited on Feb 25, 2025

metal : fix the crash caused by the lack of residency set support on Intel Macs. (llama/11904)

afbd891

Hale Chan commited on Feb 16, 2025

metal : optimize dequant q6_K kernel (llama/11892)

376cbe6

Adrian Kretz commited on Feb 15, 2025

repo : update links to new url (llama/11886)

9705bb5

ggerganov HF Staff commited on Feb 15, 2025

metal : avoid breaking build when metal API predates TARGET_OS_VISION (llama/11690)

5bdb244

charles-dyfis-net commited on Feb 6, 2025

metal : adjust support conditions for norm operators (llama/11671)

5eb35ab

ggerganov HF Staff commited on Feb 5, 2025

CUDA: non-contiguous (RMS) norm support (llama/11659)

4c2e171

JohannesGaessler

ggerganov HF Staff commited on Feb 4, 2025

metal : use residency set for other platforms (llama/11648)

0e58088

jhenjie commited on Feb 4, 2025

metal: Handle null returned from MTLCreateSystemDefaultDevice() (llama/11441)

4e38ed4

Ihar Hrachyshka commited on Jan 27, 2025

metal : use residency sets (llama/11427)

9da4d68

ggerganov HF Staff commited on Jan 26, 2025

metal : fix out-of-bounds write (llama/11314)

1101050

ggerganov HF Staff commited on Jan 21, 2025

ggml : do not install metal source when embed library (ggml/1054)

9615cf2

ggerganov HF Staff commited on Jan 3, 2025

metal : avoid uint (llama/11019)

b788516

ggerganov HF Staff commited on Jan 3, 2025

llama : add Qwen2VL support + multimodal RoPE (llama/10361)

219d12b

RzZ

ggerganov HF Staff commited on Dec 14, 2024

metal : Extend how Llama.cpp locates metal resources (llama/10676)

44e7250

Robert Ormandi

ggerganov HF Staff commited on Dec 7, 2024

ggml: add `GGML_SET` Metal kernel + i32 CPU kernel (ggml/1037)

dd775d5

PABannier commited on Dec 4, 2024

ggml : add `GGML_PAD_REFLECT_1D` operation (ggml/1034)

154bbc0

PABannier commited on Dec 3, 2024

ggml : move AMX to the CPU backend (llama/10570)

3732429

Diego Devesa commited on Dec 3, 2024

metal : small-batch mat-mul kernels (llama/10581)

58b0822

ggerganov HF Staff commited on Dec 3, 2024

metal : fix group_norm support condition (llama/0)

20ee62d

ggerganov HF Staff commited on Nov 27, 2024

metal : enable mat-vec kernels for bs <= 4 (llama/10491)

6d07dee

ggerganov HF Staff commited on Nov 25, 2024

ggml : add support for dynamic loading of backends (llama/10469)

b73266f

Diego Devesa

ggerganov HF Staff commited on Nov 25, 2024

metal : minor code formatting

385a521

ggerganov HF Staff commited on Nov 25, 2024

feat: add `GGML_UNARY_OP_ARGMAX` Metal kernel (ggml/1019)

c7e59ef

PABannier Diego Devesa commited on Dec 2, 2024

metal : add `GGML_OP_CONV_TRANSPOSE_1D` kernels (ggml/1026)

9c845f4

PABannier commited on Nov 28, 2024

ggml : sync resolve (skip) (#0)

d4d67dc

ggerganov HF Staff commited on Nov 19, 2024

metal : fox offset integer overflows in im2col (ggml/1015)

efbd100

pacominev commited on Nov 18, 2024

metal : add `GGML_UNARY_OP_ELU` kernel (ggml/1018)

5959420

PABannier commited on Nov 18, 2024

metal : refactor kernel args into structs (llama/10238)

15659b4

ggerganov HF Staff commited on Nov 17, 2024

ggml: new optimization interface (ggml/988)

dd33ace

JohannesGaessler commited on Nov 16, 2024

sync : leftovers (ggml/0)

0f6c498

ggerganov HF Staff commited on Nov 15, 2024

ggml : build backends as libraries (llama/10256)

3dc93f3

Diego Devesa

ggerganov HF Staff R0CKSTAR commited on Nov 14, 2024

Commit History

metal : simplify kernel arguments using a struct (ggml/3229) (llama/12194) 092277a

metal : fix default.metallib build (llama/12224) 838efb6

ggml : fix GGMLMetalClass ODR (llama/12200) 2094cb7

cuda/cpu: Increase support for fp16 unary operations (ggml/1125) 67e8c32

metal : copy kernels for quant to F32/F16 conversions (llama/12017) 6c8e7ec

metal : fix the crash caused by the lack of residency set support on Intel Macs. (llama/11904) afbd891

metal : optimize dequant q6_K kernel (llama/11892) 376cbe6

repo : update links to new url (llama/11886) 9705bb5

metal : avoid breaking build when metal API predates TARGET_OS_VISION (llama/11690) 5bdb244

metal : adjust support conditions for norm operators (llama/11671) 5eb35ab

CUDA: non-contiguous (RMS) norm support (llama/11659) 4c2e171

metal : use residency set for other platforms (llama/11648) 0e58088

metal: Handle null returned from MTLCreateSystemDefaultDevice() (llama/11441) 4e38ed4

metal : use residency sets (llama/11427) 9da4d68

metal : fix out-of-bounds write (llama/11314) 1101050

ggml : do not install metal source when embed library (ggml/1054) 9615cf2

metal : avoid uint (llama/11019) b788516

llama : add Qwen2VL support + multimodal RoPE (llama/10361) 219d12b

metal : Extend how Llama.cpp locates metal resources (llama/10676) 44e7250

ggml: add `GGML_SET` Metal kernel + i32 CPU kernel (ggml/1037) dd775d5

ggml : add `GGML_PAD_REFLECT_1D` operation (ggml/1034) 154bbc0

ggml : move AMX to the CPU backend (llama/10570) 3732429

metal : small-batch mat-mul kernels (llama/10581) 58b0822

metal : fix group_norm support condition (llama/0) 20ee62d

metal : enable mat-vec kernels for bs <= 4 (llama/10491) 6d07dee

ggml : add support for dynamic loading of backends (llama/10469) b73266f

metal : minor code formatting 385a521

feat: add `GGML_UNARY_OP_ARGMAX` Metal kernel (ggml/1019) c7e59ef

metal : add `GGML_OP_CONV_TRANSPOSE_1D` kernels (ggml/1026) 9c845f4

ggml : sync resolve (skip) (#0) d4d67dc

metal : fox offset integer overflows in im2col (ggml/1015) efbd100

metal : add `GGML_UNARY_OP_ELU` kernel (ggml/1018) 5959420

metal : refactor kernel args into structs (llama/10238) 15659b4

ggml: new optimization interface (ggml/988) dd33ace

sync : leftovers (ggml/0) 0f6c498

ggml : build backends as libraries (llama/10256) 3dc93f3

metal : simplify kernel arguments using a struct (ggml/3229) (llama/12194)

092277a

metal : fix default.metallib build (llama/12224)

838efb6

ggml : fix GGMLMetalClass ODR (llama/12200)

2094cb7

cuda/cpu: Increase support for fp16 unary operations (ggml/1125)

67e8c32

metal : copy kernels for quant to F32/F16 conversions (llama/12017)

6c8e7ec

metal : fix the crash caused by the lack of residency set support on Intel Macs. (llama/11904)

afbd891

metal : optimize dequant q6_K kernel (llama/11892)

376cbe6

repo : update links to new url (llama/11886)

9705bb5

metal : avoid breaking build when metal API predates TARGET_OS_VISION (llama/11690)

5bdb244

metal : adjust support conditions for norm operators (llama/11671)

5eb35ab

CUDA: non-contiguous (RMS) norm support (llama/11659)

4c2e171

metal : use residency set for other platforms (llama/11648)

0e58088

metal: Handle null returned from MTLCreateSystemDefaultDevice() (llama/11441)

4e38ed4

metal : use residency sets (llama/11427)

9da4d68

metal : fix out-of-bounds write (llama/11314)

1101050

ggml : do not install metal source when embed library (ggml/1054)

9615cf2

metal : avoid uint (llama/11019)

b788516

llama : add Qwen2VL support + multimodal RoPE (llama/10361)

219d12b

metal : Extend how Llama.cpp locates metal resources (llama/10676)

44e7250

ggml: add `GGML_SET` Metal kernel + i32 CPU kernel (ggml/1037)

dd775d5

ggml : add `GGML_PAD_REFLECT_1D` operation (ggml/1034)

154bbc0

ggml : move AMX to the CPU backend (llama/10570)

3732429

metal : small-batch mat-mul kernels (llama/10581)

58b0822

metal : fix group_norm support condition (llama/0)

20ee62d

metal : enable mat-vec kernels for bs <= 4 (llama/10491)

6d07dee

ggml : add support for dynamic loading of backends (llama/10469)

b73266f

metal : minor code formatting

385a521

feat: add `GGML_UNARY_OP_ARGMAX` Metal kernel (ggml/1019)

c7e59ef

metal : add `GGML_OP_CONV_TRANSPOSE_1D` kernels (ggml/1026)

9c845f4

ggml : sync resolve (skip) (#0)

d4d67dc

metal : fox offset integer overflows in im2col (ggml/1015)

efbd100

metal : add `GGML_UNARY_OP_ELU` kernel (ggml/1018)

5959420

metal : refactor kernel args into structs (llama/10238)

15659b4

ggml: new optimization interface (ggml/988)

dd33ace

sync : leftovers (ggml/0)

0f6c498

ggml : build backends as libraries (llama/10256)

3dc93f3