Size discrepancies
#1
by aoleg - opened
The "quality" models are smaller than the "balanced" ones. Is this by design? Is the quality of the smaller "quality" quants actually higher than that of the larger "balanced" ones?
I was also wondering why APEX mini is larger by 1GB on Qwen 3.6 vs 3.5.