Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
purbeshmitra
/
vanillaGRPO
like
0
Text Generation
Transformers
Safetensors
openai/gsm8k
HuggingFaceH4/MATH-500
HuggingFaceH4/aime_2024
English
arxiv:
2507.02851
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
ba38ea0
vanillaGRPO
Ctrl+K
Ctrl+K
2 contributors
History:
12 commits
purbeshmitra
nielsr
HF Staff
Add pipeline tag and update library_name (
#1
)
ba38ea0
verified
10 months ago
assets
Rename multiround.png to assets/multiround.png
10 months ago
.gitattributes
Safe
1.75 kB
Rename multiround.png to assets/multiround.png
10 months ago
README.md
2.6 kB
Add pipeline tag and update library_name (#1)
10 months ago
adapter_config.json
Safe
876 Bytes
Upload 3 files
10 months ago
adapter_model.safetensors
Safe
148 MB
xet
Upload 3 files
10 months ago