Inference Providers
Active filters: grpo
justinj92/Qwen2.5-1.5B-Thinking-v1.1
Text Generation
• 2B • Updated • 6
• 2
jainamit/qwen-2.5-3b-r1-countdown
Text Generation
• 3B • Updated • 1
GitBag/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated • 1
justinj92/Qwen2.5-1.5B-Thinking-v1.1-Q8_0-GGUF
2B • Updated • 12
justinj92/Qwen2.5-1.5B-Thinking-v1.1-Q5_K_M-GGUF
2B • Updated • 18
Text Generation
• 8B • Updated • 3
mradermacher/Qwen2.5-1.5B-Thinking-GGUF
2B • Updated • 37
• 1
mradermacher/DeepSeek-R1-Qwen-2.5-1.5b-GGUF
2B • Updated • 542
• 1
Text Generation
• Updated • 26
• peulsilva/reasoning-qwen-epoch0
Text Generation
• 0.5B • Updated • 1
peulsilva/reasoning-qwen-epoch1
Text Generation
• 0.5B • Updated • 4
spinech/qwen2.5-3b-r1-arc-train-synthetic
Text Generation
• 3B • Updated • 5
peulsilva/reasoning-qwen-epoch2
Text Generation
• 0.5B • Updated • 2
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math
Text Generation
• 8B • Updated • 11
Text Generation
• 8B • Updated • 3
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math
Text Generation
• 2B • Updated • 2
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math
Text Generation
• 2B • Updated • 9
peulsilva/reasoning-qwen-epoch3
Text Generation
• 0.5B • Updated • 1
mradermacher/DeepSeek-R1-Distill-Qwen-7B-GRPO-GGUF
8B • Updated • 119
skzxjus/Qwen2.5-7B-Open-R1-GRPO
Text Generation
• 8B • Updated • 8
AndreasX1206/Qwen2-0.5B-countdown
Text Generation
• 0.5B • Updated • 1
• mradermacher/Qwen-0.5B-GRPO-GGUF
0.5B • Updated • 28
alicogniai/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated • 1
ununtrium/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated • 4
mradermacher/DeepSeek-R1-Distill-Qwen-7B-GRPO-i1-GGUF
8B • Updated • 776
yuta0x89/llmjp13b-numinacot-epoch2-GRPO
Text Generation
• 14B • Updated • 3
yeshsurya/Qwen2.5-7B-Math-with_50stepGRPO
Text Generation
• 8B • Updated • 1
mradermacher/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math-GGUF
2B • Updated • 80
mradermacher/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-GGUF
8B • Updated • 192
mradermacher/DeepSeek-R1-Qwen-2.5-1.5b-Latest-Unstructured-To-Structured-GGUF
2B • Updated • 187
• 1