Lansechen/Qwen2.5-3B-Distill-om220k-fem32768-batch32-epoch3-8192 Text Generation • 3B • Updated Mar 22, 2025 • 4
Lansechen/Qwen2.5-3B-Distill-om220k-fhm32768-batch32-epoch3-8192 Text Generation • 3B • Updated Mar 24, 2025 • 3
cg666/Qwen-2.5-Base-3B-gen8-scale-MATH-lighteval-olympiads_aime-unique-ghpo-beta0-epoch3 Text Generation • 3B • Updated Apr 11, 2025 • 5
cg666/Qwen-2.5-Base-3B-gen8-scale-math_selected-grpo-beta0-epoch3 Text Generation • 3B • Updated Apr 10, 2025 • 4
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine Text Generation • 3B • Updated Apr 10, 2025 • 2
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-v2 Text Generation • 3B • Updated Apr 11, 2025 • 1
cg666/openr1-Qwen-2.5-Base-3B-gen8-scale-NuminaMath-TIR-100-grpo-beta0-epoch2 Text Generation • 3B • Updated Apr 11, 2025 • 2
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW Text Generation • 3B • Updated Apr 11, 2025 • 6 •
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-default Text Generation • 3B • Updated Apr 12, 2025 • 3 •
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW-RP Text Generation • 3B • Updated Apr 15, 2025 • 2
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW-RP-v2 Text Generation • 3B • Updated Apr 16, 2025 • 4
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW-RP-v3 Text Generation • 3B • Updated Apr 17, 2025 • 2