arvindcr4/tinker-rl-frontier_gsm8k_nemotron-120b-nemotron-120b Reinforcement Learning • Updated 1 day ago
arvindcr4/tinker-rl-frontier_gsm8k_deepseek-v3.1-deepseek-v3.1 Reinforcement Learning • Updated 1 day ago
arvindcr4/tinker-rl-w1_qwen3-8b-base-qwen3-8b-base-s42-run1 Reinforcement Learning • Updated 1 day ago
arvindcr4/tinker-rl-w1_qwen3-8b-base-qwen3-8b-base-s42-run2 Reinforcement Learning • Updated 1 day ago