xiaolesu
/

OsmosisProofling-SFT-NT-GRPO-NT-Overlap

Model card Files Files and versions

xiaolesu commited on 10 days ago

Commit

14c72f0

·

verified ·

1 Parent(s): 4dc4fd2

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -1,3 +1,11 @@
 ### xiaolesu/OsmosisProofling-SFT-NT-GRPO-NT-Overlap
 Experimental checkpoint from "Data Overlap as a Post-Training Hyperparameter for Autoformalization." This is the **SFT+GRPO with 100% overlap** variant (Qwen3-8B, thinking disabled) -- the control condition where GRPO reuses SFT data entirely. See the [paper repo](https://github.com/suxls/data-overlap-autoformalization) for details, results, and all artifacts.

 ### xiaolesu/OsmosisProofling-SFT-NT-GRPO-NT-Overlap
 Experimental checkpoint from "Data Overlap as a Post-Training Hyperparameter for Autoformalization." This is the **SFT+GRPO with 100% overlap** variant (Qwen3-8B, thinking disabled) -- the control condition where GRPO reuses SFT data entirely. See the [paper repo](https://github.com/suxls/data-overlap-autoformalization) for details, results, and all artifacts.
+## 📄 Paper
+This model is part of the experiments in:
+**SFT-GRPO Data Overlap as a Post-Training Hyperparameter for Autoformalization**
+Xiaole Su, Kasey Zhang, Andy Lyu
+https://arxiv.org/abs/2604.13515