Inference Providers
Active filters: open-r1
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr_newdata
Text Generation
• 8B • Updated • 4
schwamaths/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• Updated ibndias/Qwen2.5-1.5B-Open-R1-GRPO1st
Text Generation
• 2B • Updated • 1
schwamaths/Qwen2.5-1.5B-Instruct-Open-R1-GRPO
Text Generation
• Updated • 5
weltonwang88/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated Jiawen006/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 1
mradermacher/Qwen2.5-1.5B-Instruct-Open-R1-GRPO-GGUF
2B • Updated • 23
AdAstraAbyssoque/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated • 1
JeffP111/Qwen2.5-3B-GRPO-Countdown
Text Generation
• 3B • Updated • 1
susumuota/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 2
susumuota/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated • 7
calledice666/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated DominicX/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 1
Loong-Ma/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 2
bushou/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated • 1
DeeLearning/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 7
KevinWugk/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 1
didao1234/Qwen2.5-Math-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated princepride/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 1
daltunay/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 2
herman66/Qwen2.5-0.5B-Open-R1-Distill
Text Generation
• 0.5B • Updated • 2
• 1
tenacioustommy/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 3B • Updated • 6
Maker-0409/Qwen-2.5-7B-Simple-RL
Text Generation
• 8B • Updated • 4
whooray/Qwen2.5-1.5B-Open-R1-Distill-ko
Text Generation
• 2B • Updated • 5
brishtiteveja/BanglaLLM-Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated • 7
byteXWJ/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated Qucy/Qwen2.5-0.5B-Open-R1-Distill
Text Generation
• 0.5B • Updated ununtrium/Qwen2.5-1.5B-Instruct-Open-R1-GRPO-gsm8k2
Text Generation
• 2B • Updated • 1
ashokvaktariya1/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated