Spaces:
Running
Running
Commit History
code : normalize enum names (llama/5697) 93e0830 unverified
IQ3_S: a much better alternative to Q3_K (llama/5676) 32589c9 unverified
Introduce backend GUIDs (ggml/743) a7eb9f6 unverified
UEXTM.com slaren commited on
talk, talk-llama : pass text_to_speak as a file (#1865) 3fd8b4d unverified
Tamotsu Takahashi commited on
whisper : add SYCL support (#1863) 9a168fc unverified
Abhilash Majumder commited on
talk-llama : sync llama.cpp 53d0282 unverified
sync : ggml 4f680fe unverified
ggml : always define ggml_fp16_t as uint16_t (llama/5666) bc567d3 unverified
ci : fix whitespace d75e158 unverified
ggml : 32-bit arm compat (#1891) 7551157 unverified
sync : ggml 096caf3 unverified
sync : llama.cpp (ggml/0) f8e8d34 unverified
conext add name (llama/5624) 3c39d4b unverified
Update ggml_sycl_op_mul_mat_vec_q (llama/5502) 963ffd5 unverified
AidanBeltonS Abhilash Majumder commited on
Refactor validation and enumeration platform checks into functions to clean up ggml_vk_instance_init() 8637c17 unverified
Add check for VK_KHR_portability_enumeration for MoltenVK support 85caa3f unverified
Add preprocessor checks for Apple devices. b8e3b87 unverified
Resolve ErrorIncompatibleDriver with Vulkan on MacOS. 0bc3433 unverified
Allow for Vulkan build with Accelerate. 7d255ac unverified
cuda : ignore peer access already enabled errors (llama/5597) a817d85 unverified
slaren commited on
ggml : compute forward no longer pass src tensors (ggml/729) 4e31c82 unverified
Siddharth Ramakrishnan siddharthvader commited on
ggml : fix conv_2d batch mode (ggml/737) 99ece5c unverified
openvino : fix convert-whisper-to-openvino.py (#1890) dfd53cc unverified
st-gr Stefan Grundmann commited on
main : fix file existence check in main.cpp (#1889) 9162df9 unverified
talk-llama : sync llama.cpp 542accf unverified
make : fix CUBLAS link with WSL (#1878) b3c9e81 unverified
LBlue commited on
sync : ggml cb5b2be unverified
ggml : resolve merge conflicts (ggml/0) 7ee6ffa unverified
common : add IQ1_S (ggml/0) 39c054e unverified
ci : enable -Werror for CUDA builds (llama/5579) df03a10 unverified
cuda, metal : fix nans in soft_max (llama/5574) 44164ac unverified
ggml : android and old glibc NUMA incompatibility bugfixes (llama/5557) 0206c2d unverified
ggml : restore vec dot stride arg names (llama/5453) de4041f unverified
ci : fix wikitext url + compile warnings (llama/5569) 49f0106 unverified
metal : fix unused warnings (llama/0) d12cda5 unverified
ggml, common, examples, tests : fixed type arguments in printf (llama/5528) 2f3a004 unverified
1.5 bit quantization (llama/5453) 9c3aa6a unverified
ggml : add ALiBi support for ggml_soft_max_ext (llama/5488) 26c019a unverified
cmake : fix VULKAN and ROCm builds (llama/5525) ae570e4 unverified
ggml : add numa options (llama/5377) 7c952d2 unverified
cuda : print message when initialization fails (llama/5512) 1f047ca unverified
slaren commited on
vulkan: Find optimal memory type but with fallback (llama/5381) 24e2319 unverified
Early return for zero size calls to get_tensor. (llama/5482) f1f5c00 unverified
ggml-quants : fix compiler warnings (shadow variable) (llama/5472) e538f25 unverified
ggml-sycl: Replace 3d ops with macro (llama/5458) 12970f1 unverified
Abhilash Majumder commited on