arxiv:2601.10201
Jiarui Yao
FlippyDora
AI & ML interests
None yet
Recent Activity
upvoted a paper about 10 hours ago
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models upvoted a paper about 10 hours ago
Rethinking the Divergence Regularization in LLM RL upvoted a paper about 15 hours ago
Lean4Agent: Formal Modeling and Verification for Agent Workflow and Trajectory