Q4_0 is a bit faster for pure CPU inference on my experience, would be nice to have. Thank you.
Β· Sign up or log in to comment