See https://x.com/anemll/status/1902040540704862686 for more info on iOS including TestFlight link Demo DeepHermes 3B model in LUT4 ( high quantization )