SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks
Paper • 2606.09669 • Published • 38
None defined yet.
Boosting Omni-Modal Language Models: Staged Post-Training with Visually Debiased Evaluation
Step-Audio-R1.5 Technical Report