Pioneer Agent: Continual Improvement of Small Language Models in Production Paper • 2604.09791 • Published 9 days ago • 2
Toward Autonomous Long-Horizon Engineering for ML Research Paper • 2604.13018 • Published 5 days ago • 31
Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Paper • 2604.12374 • Published 5 days ago • 29
SciPredict: Can LLMs Predict the Outcomes of Scientific Experiments in Natural Sciences? Paper • 2604.10718 • Published 7 days ago • 4
The Depth Ceiling: On the Limits of Large Language Models in Discovering Latent Planning Paper • 2604.06427 • Published 12 days ago • 11
ParseBench: A Document Parsing Benchmark for AI Agents Paper • 2604.08538 • Published 10 days ago • 7
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music Paper • 2604.10905 • Published 6 days ago • 26
The ATOM Report: Measuring the Open Language Model Ecosystem Paper • 2604.07190 • Published 11 days ago • 5
MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models Paper • 2603.25744 • Published 23 days ago • 13
SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning Paper • 2603.22057 • Published 26 days ago • 46
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models Paper • 2603.17051 • Published Mar 17 • 109
When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning Paper • 2603.21289 • Published 27 days ago • 35
Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought Paper • 2603.22847 • Published 25 days ago • 26
Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models Paper • 2603.24844 • Published 24 days ago • 10