Learning Long-term Motion Embeddings for Efficient Kinematics Generation Paper • 2604.11737 • Published 3 days ago • 4
OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation Paper • 2604.11804 • Published 3 days ago • 62
Strips as Tokens: Artist Mesh Generation with Native UV Segmentation Paper • 2604.09132 • Published 6 days ago • 47
QuanBench+: A Unified Multi-Framework Benchmark for LLM-Based Quantum Code Generation Paper • 2604.08570 • Published 22 days ago • 119
ELT: Elastic Looped Transformers for Visual Generation Paper • 2604.09168 • Published 6 days ago • 19
RefineAnything: Multimodal Region-Specific Refinement for Perfect Local Details Paper • 2604.06870 • Published 8 days ago • 38
Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory Paper • 2604.08995 • Published 6 days ago • 42
WildDet3D: Scaling Promptable 3D Detection in the Wild Paper • 2604.08626 • Published 7 days ago • 230
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper • 2604.08377 • Published 7 days ago • 274
Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision Paper • 2604.04934 • Published 10 days ago • 42
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 10 days ago • 107
GaussianGPT: Towards Autoregressive 3D Gaussian Scene Generation Paper • 2603.26661 • Published 19 days ago • 26
Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models Paper • 2603.25750 • Published 27 days ago • 36
4DGS360: 360° Gaussian Reconstruction of Dynamic Objects from a Single Video Paper • 2603.21618 • Published 24 days ago • 15
Toward Physically Consistent Driving Video World Models under Challenging Trajectories Paper • 2603.24506 • Published 21 days ago • 6
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation Paper • 2603.19220 • Published 27 days ago • 66
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild Paper • 2603.17187 • Published 29 days ago • 138
WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation Paper • 2603.16871 • Published 29 days ago • 60