is there float16 version that can run on Android ?
#3
by vastheaven - opened
This int4 quantized version's performance is much lower than the gemma3 1b running on Ollama. I need a standard version that can run on Android demo
vastheaven changed discussion title from is this int4 quantized version? to is there float16 version ?
vastheaven changed discussion title from is there float16 version ? to is there float16 version that can run on Android ?