Inference Providers
Active filters: open-r1
Studyboy/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 1
KKHYA/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated zzzch/Qwen2.5-0.5B-Open-R1-GRPO
Text Generation
• 0.6B • Updated • 10
atmatechai/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 1
zean/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated ztt0821/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated • 1
Studyboy/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 3B • Updated • 1
lewtun/smollm2-distill-default-chat-template
Text Generation
• 2B • Updated • 2
flyingbugs/Qwen2.5-7B-Open-R1-Distill-mixed
Text Generation
• 8B • Updated • 2
flyingbugs/Qwen2.5-7B-Open-R1-Distill-bi
Text Generation
• 8B • Updated • 1
wnj13/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 4
• 2
coolcoolad/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-bs17k-batch32-epoch5
Text Generation
• 7B • Updated • 1
yfliao/Qwen-2.5-7B-Simple-RL
Text Generation
• 8B • Updated yfliao/Qwen-2.5-1.5B-Simple-RL
Text Generation
• 2B • Updated Bradley/Qwen-2.5-1.5B-Simple-RL-ga1
Text Generation
• 2B • Updated • 8
Bradley/Qwen-2.5-1.5B-Simple-RL-ga8
Text Generation
• 2B • Updated rkumar1999/Llama-3.1-8B-Instruct-Open-R1-Distill
Text Generation
• Updated • 2
ununtrium/Qwen2.5-1.5B-Open-R1-GRPO-2rewards
Text Generation
• 2B • Updated • 1
SWY666/Qwen-2.5-7B-Simple-RL-with-reward-model
Text Generation
• 8B • Updated • 1
a-F1/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 5
coolcui/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 2
zhangyitony/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 3
foxiwift/Qwen2.5-7B-Open-R1-Distill-AR
Text Generation
• Updated Lansechen/Qwen2.5-3B-Instruct-Distill-ot114k-batch32
Text Generation
• 3B • Updated • 6
zhangyitony/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Text Generation
• 2B • Updated • 10
jeff-gao/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 2
coolcui/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Text Generation
• 2B • Updated • 1
mathczh/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 24
Sunnylululu/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 18