wordle-grpo-Qwen3-1.7B-test / model-00001-of-00002.safetensors

Commit History

Training in progress, step 10
62473b1
verified

taozhang9527 commited on