File size: 627 Bytes
4dc4fd2
0a0cab4
4dc4fd2
14c72f0
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
### xiaolesu/OsmosisProofling-SFT-NT-GRPO-NT-Overlap

Experimental checkpoint from "Data Overlap as a Post-Training Hyperparameter for Autoformalization." This is the **SFT+GRPO with 100% overlap** variant (Qwen3-8B, thinking disabled) -- the control condition where GRPO reuses SFT data entirely. See the [paper repo](https://github.com/suxls/data-overlap-autoformalization) for details, results, and all artifacts.

## 📄 Paper

This model is part of the experiments in:

**SFT-GRPO Data Overlap as a Post-Training Hyperparameter for Autoformalization**  
Xiaole Su, Kasey Zhang, Andy Lyu  
https://arxiv.org/abs/2604.13515