Spaces:
Running
Running
Commit History
sync : ggml (#2001) cbbfa9e unverified
metal : build metallib + fix embed path (llama/6015) 27311ef unverified
ggml : reuse quantum structs across backends (llama/5943) bb0625f unverified
metal : move mm_id indices to shared mem (llama/5982) 1350705 unverified
ggml : introduce ggml_status (ggml/750) 151c676 unverified
IQ4_XS: a 4.25 bpw quantization (llama/5747) 0ee1bfb unverified
code : normalize enum names (llama/5697) 93e0830 unverified
IQ3_S: a much better alternative to Q3_K (llama/5676) 32589c9 unverified
Introduce backend GUIDs (ggml/743) a7eb9f6 unverified
UEXTM.com slaren commited on
sync : llama.cpp (ggml/0) f8e8d34 unverified
1.5 bit quantization (llama/5453) 9c3aa6a unverified
ggml : add ALiBi support for ggml_soft_max_ext (llama/5488) 26c019a unverified
metal : use autoreleasepool to avoid memory leaks (llama/5437) c276f12 unverified
metal : option to embed MSL source into compiled binary (#1842) a46b62a unverified
Didzis Gosko commited on
metal : add im2col F32 dst support (llama/5132) 26aec77 unverified
SOTA 3-bit quants (llama/5196) 4649943 unverified
ggml : add max buffer sizes to opencl and metal backends (llama/5181) 3d354d0 unverified
slaren commited on
metal : free metal objects (llama/5161) ea7167a unverified
Paul Tsochantaris commited on
ci : fix yolo URLs + fix metal capture (ggml/712) 588f789 unverified
metal : add debug capture backend function (ggml/694) ece88c3 unverified
ggml : add Vulkan backend (llama/2059) 5a97aba unverified
metal : remove unused `n_buffers` and `buffers` (llama/5129) a3e87d3 unverified
Paul Tsochantaris commited on
metal : show compile log messages ae08f31 unverified
metal : disable support for MUL_MAT F32 x F16 7fbc01f unverified
ggml : sync ggml-metal.m b4085c3 unverified
metal : create autorelease pool during library build (llama/4970) 9027276 unverified
metal : log `recommendedMaxWorkingSetSize` on iOS 16+ (llama/4936) e2cc0e5 unverified
ggml : introduce GGML_CALL function annotation (llama/4850) 7815f68 unverified
metal : correctly set SIMD support flags on iOS (llama/4923) 1cf2fa9 unverified
metal : remove old API (llama/4919) d6abb6a unverified
metal : disable log for loaded kernels (llama/4794) 2305485 unverified
metal : refactor kernel loading code (llama/4794) 53e6bf8 unverified
llama : ggml-backend integration (llama/4766) 362430b unverified
ggml : SOTA 2-bit quants (add IQ2_XS) (llama/4856) 5e827d5 unverified
metal : put encoder debug group behind a define (llama/4873) 6e822b8 unverified
Paul Tsochantaris commited on
metal : fix deprecation warning (ggml/690) b1e29bc unverified
metal : wrap each operation in debug group (ggml/690) b5e360f unverified
Jack Mousseau commited on