ERNIE-Image Collection The serieas of image generation models, including text2img、img2img. • 2 items • Updated about 10 hours ago • 10
ClawBench: Can AI Agents Complete Everyday Online Tasks? Paper • 2604.08523 • Published 6 days ago • 252
HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents Paper • 2604.07430 • Published 7 days ago • 163
VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning Paper • 2505.22019 • Published May 28, 2025 • 12
TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders Paper • 2604.07340 • Published 7 days ago • 15
view article Article How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs 7 days ago • 50
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models Paper • 2604.04707 • Published 9 days ago • 200
Marco-MoE Collection A suit of multilingual MoE models with highly-sparse architectures • 5 items • Updated 7 days ago • 14
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 13 days ago • 842
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Paper • 2603.25040 • Published 20 days ago • 130