Size discrepancies

#1
by aoleg - opened

The "quality" models are smaller than the "balanced" ones. Is this by design? Is the quality of the smaller "quality" quants actually higher than that of the larger "balanced" ones?

I was also wondering why APEX mini is larger by 1GB on Qwen 3.6 vs 3.5.

Sign up or log in to comment