Commit History

v45: SFT+chained GRPO with ITN — 95.9% number accuracy, 97.0% filler-free, deletion behavior matches v36
a9fa15c
verified

juanquivilla commited on

v36: full-FT GRPO with substantive-deletion-aware reward — filler-free 96.9%, sub-del-15-long 0.64%
74443d8
verified

juanquivilla commited on

v15: 5-bit MLX quant (237MB, ROUGE-L ~0.955)
0ac2ec8
verified

juanquivilla commited on

5-bit MLX: ROUGE-L 0.926, 233MB, 56% exact, 99% filler-free
fcae46e
verified

juanquivilla commited on