Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
Paper
• 2509.07980
• Published • 105
Robot Learning from a Physical World Model
Paper
• 2511.07416
• Published • 32
MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning
Paper
• 2511.06805
• Published • 13
GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms
Paper
• 2511.17592
• Published • 121
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
Paper
• 2511.19399
• Published • 63
OmniRefiner: Reinforcement-Guided Local Diffusion Refinement
Paper
• 2511.19990
• Published • 4
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer
Paper
• 2511.22699
• Published • 245
TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows
Paper
• 2512.05150
• Published • 76
LightRAG: Simple and Fast Retrieval-Augmented Generation
Paper
• 2410.05779
• Published • 39
DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Paper
• 2503.14476
• Published • 146
VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation
Paper
• 2601.02256
• Published • 33
NitroGen: An Open Foundation Model for Generalist Gaming Agents
Paper
• 2601.02427
• Published • 46
Aligning Text, Code, and Vision: A Multi-Objective Reinforcement Learning Framework for Text-to-Visualization
Paper
• 2601.04582
• Published • 10
Efficient Camera-Controlled Video Generation of Static Scenes via Sparse Diffusion and 3D Rendering
Paper
• 2601.09697
• Published • 8
NanoQuant: Efficient Sub-1-Bit Quantization of Large Language Models
Paper
• 2602.06694
• Published • 15
Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems
Paper
• 2602.08847
• Published • 29