Spaces:
Running
Running
Commit History
Fix SYCL `im2col` and `convert` Overflow with Large Dims (llama/9052) 5f43886
rpc : print error message when failed to connect endpoint (llama/9042) d54b156
rpc : prevent crashes on invalid input (llama/9040) 656ae00
ggml : dynamic ggml_sched_max_splits based on graph_size (llama/9047) e0dc1ad
cmake : remove unused option GGML_CURL (llama/9011) 12634fc
ggml : move rope type enum to ggml.h (llama/8949) 9d45f48
ggml: fix div-by-zero (llama/9003) d9ee26f
DavidKorczynski commited on
Optimize Vulkan backend for better CPU performance and less GPU synchronization overhead. (llama/8943) 11bc9e6
feat: ref. cross entropy, add CUDA, fix grad test (ggml/929) e1e87a3
ggml: remove bad assert (ggml/928) ba483f7
examples: add MNIST training + missing ops 0828065
sync : ggml vulkan (ggml/0) c4c7e49
ggml : fix typo in ggml-quants.c comment (ggml/922) f158bc0
feat: add new `sin` and `cos` operators (ggml/919) f541d31
ggml : support forward pass broadcasting in ggml_sub (ggml/914) 0af2d37 unverified
metal : fix uninitialized abort_callback (llama/8968) f971b60 unverified
slaren commited on
rpc : sanitize tensor data + warnings (llama/0) 87d58fe unverified
cann : add Ascend NPU support (#2336) 94baae9 unverified
ggml : add CANN backend (llama/0) 7c34a03
ggml-backend : fix async copy from CPU (llama/8897) 050174c
slaren commited on
Updated SYCL device filtering (llama/8901) 64976cd
Ouadie EL FAROUKI commited on
CUDA/HIP: fix tests/test-backend-ops (llama/8896) f14c1ad
CUDA: fix padding logic for FP16/FP32 (llama/8884) 643bcdb
ggml : add epsilon as a parameter for group_norm (llama/8818) d003891
ggml : fix overflows in elu function (llama/8866) a12468a
Justine Tunney commited on
ggml : reading the runtime sve config of the cpu (llama/8709) c26339f
jdomke domke commited on
Fix conversion of unnormalized BF16->BF16 weights (llama/7843) 8b10f59
Fixing wrong VDR iq4nl value (llama/8812) 30eb7bc
Ouadie EL FAROUKI commited on
ggml-cuda: Adding support for unified memory (llama/8035) 686bb18
Build: Only include execinfo.h on linux systems that support it (llama/8783) 0019ddb
Alex O'Connell commited on
cuda : fix dmmv cols requirement to 2*GGML_CUDA_DMMV_X (llama/8800) 73e80d1
slaren commited on
added android implementation of ggml_print_backtrace_symbols (llama/8751) 314d58a
cann: update cmake (llama/8765) 345a58d
Add `TIMESTEP_EMBEDDING` OP (llama/8707) 52eea23
ggml: bugfix: fix the inactive elements is agnostic for risc-v vector (llama/8748) 6989631
cuda : organize vendor-specific headers into vendors directory (llama/8746) ec2f307
R0CKSTAR commited on
add conv support (llama/8688) f0d6f5c
feat: Support Moore Threads GPU (llama/8383) a35db11
ggml : ignore more msvc warnings (ggml/906) 1b11fde
metal : fix struct name (ggml/912) 14cf8db
metal : add abort callback (ggml/905) b822172
vulkan : implement Stable Diffusion operators (ggml/904) 124c156
ggml : move c parameter comment to ggml_rope_ext (ggml/901) 6d34596
ggml : resolve sync conflicst (ggml/0) 82658f5
ggml : add ggml-aarch64 (ggml/0) 0062819
Dibakar Gope commited on
ggml : reduce hash table reset cost (llama/8698) 9808fbf
slaren commited on
ggml: handle ggml_init failure to fix NULL pointer deref (llama/8692) dc51517
DavidKorczynski commited on