readme : update links and make commands (#2489) 3767b95 unverified toboil-features commited on Oct 17, 2024
make : fix GGML_VULKAN=1 build (#2485) 110c8bd unverified ggerganov HF Staff commited on Oct 16, 2024
whisper : add dtw preset for large-v3-turbo (#2481) eae3cdd unverified rotemdan commited on Oct 15, 2024
convert : handle max_target_positions (#2477) c36e329 unverified CrispStrobe commited on Oct 14, 2024
readme : update the Quick Start section (#2475) 1d23a03 unverified SalmanFaroz commited on Oct 14, 2024
whisper : add OpenVINO init with state (#2464) 6d5166f unverified Sandro Hanea Sandro Hanea commited on Oct 8, 2024
vulkan : retry allocation with fallback flags (#2451) 9e91cbc unverified SRHMorris fdsffdsafds commited on Oct 6, 2024
whisper : remove mel leftover constants (396089f) 505ec31 unverified ggerganov HF Staff commited on Oct 5, 2024
whisper : zero-out the KV cache upon clear (#2445) b8af443 ggerganov HF Staff commited on Oct 5, 2024
ggml-backend : add device and backend reg interfaces (llama/9707) 9d74d85 Diego Devesa commited on Oct 3, 2024
Fixed dequant precision issues in Q4_1 and Q5_1 (llama/9711) 5239c28 Ouadie EL FAROUKI commited on Oct 3, 2024
ggml-backend : add device and backend reg interfaces (llama/9707) 1bdb50a Diego Devesa JohannesGaessler commited on Oct 2, 2024
Initial cmake support of SYCL for AMD GPUs (llama/9658) 7d7ac98 Alberto Cabrera Pérez commited on Oct 2, 2024
ggml/ex: calculate accuracy in graph, adapt MNIST (ggml/980) 52069b8 JohannesGaessler commited on Oct 3, 2024
ggml: refactor cross entropy loss CPU impl. (ggml/976) 2a0805f JohannesGaessler commited on Oct 2, 2024
whisper : fix excessive memory usage (#2443) afe3785 unverified ggerganov HF Staff commited on Oct 5, 2024
examples : update dr_wav.h to newer version (#2449) d678325 unverified Rahul Vadhyar commited on Oct 4, 2024
metal : reduce command encoding overhead (llama/9698) 43d5a06 ggerganov HF Staff commited on Oct 2, 2024
test: fix OPT_STEP_ADAMW for test-backend-ops (ggml/974) 76aa810 JohannesGaessler commited on Sep 30, 2024
ggml : define missing HWCAP flags (llama/9684) 1d52105 ggerganov HF Staff Willy Tarreau commited on Sep 29, 2024
ggml : add run-time detection of neon, i8mm and sve (llama/9331) 12c0e23 Dan Johansson commited on Sep 28, 2024
Enable use to the rebar feature to upload buffers to the device. (llama/9251) 760f8c2 Markus Tavenrath commited on Sep 28, 2024
ggml : remove assert for AArch64 GEMV and GEMM Q4 kernels (llama/9217) 50395aa Charles Xu commited on Sep 25, 2024
cann: fix crash when llama-bench is running on multiple cann devices (llama/9627) 068c697 dou112 commited on Sep 25, 2024
vulkan : fix build for GGML_VULKAN_RUN_TESTS, add TFLOPS to log (ggml/961) 85e2387 jeffbolznv commited on Sep 27, 2024
vulkan : argsort barriers must be under uniform control flow (ggml/951) b2602d7 smeso commited on Sep 26, 2024
ggml : fix GGML_MAX_N_THREADS + improve formatting (ggml/969) ad34655 ggerganov HF Staff commited on Sep 24, 2024
server : ffmpeg overwrite leftover temp file (#2431) 2dafb8e unverified dynafire commited on Oct 2, 2024