Add model card and metadata for Rethinking Generalization in Reasoning SFT

#1
by nielsr HF Staff - opened

This PR adds a model card for the research presented in the paper Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability.

The model card includes:

  • Relevant metadata: pipeline_tag, library_name, and license.
  • Links to the paper and the official GitHub repository.
  • A summary of the key findings regarding reasoning SFT generalization.
  • Citation information for researchers.
Cannot merge
This branch has merge conflicts in the following files:
  • README.md

Sign up or log in to comment