Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
jasonrqh
/
Qwen3-8B_Math-CoT-20k_lr5e-5_ep8_bs256
like
0
Text Generation
Transformers
Safetensors
English
reasoning
sft
chain-of-thought
arxiv:
2604.06628
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
f451c72
Qwen3-8B_Math-CoT-20k_lr5e-5_ep8_bs256
131 GB
Ctrl+K
Ctrl+K
1 contributor
History:
10 commits
nielsr
HF Staff
Add model card and metadata for Rethinking Generalization in Reasoning SFT
f451c72
verified
13 days ago
step10
Add files using upload-large-folder tool
16 days ago
step160
Add files using upload-large-folder tool
15 days ago
step20
Add files using upload-large-folder tool
15 days ago
step320
Add files using upload-large-folder tool
15 days ago
step40
Add files using upload-large-folder tool
15 days ago
step480
Add files using upload-large-folder tool
15 days ago
step640
Add files using upload-large-folder tool
15 days ago
step80
Add files using upload-large-folder tool
15 days ago
.gitattributes
Safe
1.99 kB
Add files using upload-large-folder tool
15 days ago
README.md
2.29 kB
Add model card and metadata for Rethinking Generalization in Reasoning SFT
13 days ago