view article Article The Open Source Community is backing OpenEnv for Agentic RL +15 burtenshaw, spisakjo, lysandre, darktex, willcb, qjoy, pawalt, cwing-nv, danielhanchen, andrewzhou, shimmyshimmer, Hamid-Nazeri, Sanyam, zkwentz, emre0, lewtun, sergiopaniego • 6 days ago • 78
ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research Paper • 2606.07591 • Published 17 days ago • 87
view article Article Designing the hf CLI as an agent-optimized way to work with the Hub celinah, Wauplin • 10 days ago • 56
CHI-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows? Paper • 2605.16679 • Published 30 days ago • 53
Watch Before You Answer: Learning from Visually Grounded Post-Training Paper • 2604.05117 • Published Apr 6 • 36
view article Article EMO: Pretraining mixture of experts for emergent modularity allenai • May 8 • 38
view article Article Two Years of Local AI on a Laptop: When Open Models Outpaced Moore's Law mishig • May 11 • 24
view article Article Introducing the agentic robotics appstore for 10,000 Reachy Minis clem • May 6 • 36
MediaTech Collection Collection of public datasets from the French administration, chunked, vectorized and ready to use in AI projects. • 9 items • Updated Feb 4 • 10
Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows Paper • 2604.28139 • Published Apr 30 • 42
view article Article DeepSeek-V4: a million-token context that agents can actually use burtenshaw • Apr 24 • 50