Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW-noRP-0427-updatePW Text Generation • 3B • Updated Apr 28, 2025 • 6
anmolagarwal999/Qwen2.5-3B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_10 Text Generation • 3B • Updated May 6, 2025 • 2
anmolagarwal999/Qwen2.5-3B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_30 Text Generation • 3B • Updated May 6, 2025 • 3
anmolagarwal999/Qwen2.5-3B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_42 Text Generation • 3B • Updated May 6, 2025 • 2
anmolagarwal999/Qwen2.5-3B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_60 Text Generation • 3B • Updated May 6, 2025 • 2
anmolagarwal999/Qwen2.5-3B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_80 Text Generation • 3B • Updated May 6, 2025 • 1
anmolagarwal999/Qwen2.5-3B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_90 Text Generation • 3B • Updated May 6, 2025 • 3
anmolagarwal999/Qwen2.5-3B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_110 Text Generation • 3B • Updated May 6, 2025 • 2