bench : add memcpy and ggml_mul_mat benchmarks a660ed9 unverified ggerganov HF Staff commited on Jan 18, 2023
ggml : remove obsolete zeroing + comment fixes (#390) 9c35c0d unverified ggerganov HF Staff commited on Jan 8, 2023
ggml : correct behaviour of ggml_vec_sum_f32 (#390) ffffc6e unverified Abitofevrything commited on Jan 8, 2023
ggml : improve vec_dot_f16 unrolling in flash_attn_f16 6e57274 unverified ggerganov HF Staff commited on Jan 8, 2023
ggml : fix bug in new soft max computation c59ce76 unverified ggerganov HF Staff commited on Jan 7, 2023
ggml : when using BLAS start only 1 CPU thread 6c4692f unverified ggerganov HF Staff commited on Jan 7, 2023
ggml : fix running tasks with variable number of threads 2078d85 unverified ggerganov HF Staff commited on Jan 7, 2023
ggml : unroll ggml_vec_dot_f16 in ggml_compute_forward_flash_attn_f16 f07fecd unverified ggerganov HF Staff commited on Jan 7, 2023
ggml : speed-up soft max via Accelerate + unroll fdaf59a unverified ggerganov HF Staff commited on Jan 7, 2023
ggml : use vDSP_sve and vDSP_maxv from Accelerate ed14a8b unverified ggerganov HF Staff commited on Jan 7, 2023
ggml : add SSE3 and fp16 conversion lookup table (#368) 2c3f7d4 unverified Abitofevrything ggerganov HF Staff commited on Jan 6, 2023
ggml : add void to argument-less functions f06f912 unverified ggerganov HF Staff commited on Jan 5, 2023
ggml : define MIN / MAX only if not defined (minor) 2117da6 unverified ggerganov HF Staff commited on Jan 5, 2023
ggml : improve f16 acceleration for POWER9 ppc64le f92a260 Thomas Fitzsimmons commited on Dec 30, 2022
ggml : barrier refactor + static functions 7b501c1 unverified ggerganov HF Staff commited on Dec 28, 2022
ggml : use vaddvq_f32 for slightly more efficient reduce 550fbf8 unverified ggerganov HF Staff commited on Dec 23, 2022
minor : small code cleanups (#302) 142f526 unverified Andy Maloney ggerganov HF Staff commited on Dec 22, 2022
Check for both __ARM_NEON and __ARM_FEATURE_FMA so that the project can be compiled for armv7a. 1fff54f Kevin Brothaler commited on Dec 20, 2022
ggml : implement ggml_compute_forward_dup_f16() special cases b3b8141 unverified ggerganov HF Staff commited on Dec 16, 2022
ggml : make more compatible with c99 (#262) 52bc68d unverified ggerganov HF Staff commited on Dec 16, 2022
ggml : make compatible with c99 (#262) d9c1974 unverified ggerganov HF Staff commited on Dec 13, 2022
ggml : add alternative cblas_sgemm call 2f68de6 unverified ggerganov HF Staff commited on Dec 8, 2022
ggml : use macros to inline FP16 <-> FP32 conversions 23e5614 unverified ggerganov HF Staff commited on Dec 6, 2022
ggml : remove inline specifier from fp16 <-> fp32 converters cdd3359 unverified ggerganov HF Staff commited on Dec 1, 2022
ggml : fix cross-compile Linux -> Window with mingw (#168) 29fe0ee unverified ggerganov HF Staff commited on Nov 23, 2022
ggml: change inline ggml_fp16_to_fp32, ggml_fp16_t ggml_fp32_to_fp16 b2f844a katsu560 commited on Nov 23, 2022
ggml : multi-thread the ggml_add operator c36d8ed unverified ggerganov HF Staff commited on Nov 3, 2022
ggml : fix the check for NEON support (#7) 1f7e8fa unverified ggerganov HF Staff commited on Nov 2, 2022