Cross-Modal Emotion Transfer for Emotion Editing in Talking Face Video Paper • 2604.07786 • Published 13 days ago • 6
HandVQA: Diagnosing and Improving Fine-Grained Spatial Reasoning about Hands in Vision-Language Models Paper • 2603.26362 • Published 25 days ago
LighthouseGS: Indoor Structure-aware 3D Gaussian Splatting for Panorama-Style Mobile Captures Paper • 2507.06109 • Published Jul 8, 2025
Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects Paper • 2403.16428 • Published Mar 25, 2024
BIGS: Bimanual Category-agnostic Interaction Reconstruction from Monocular Videos via 3D Gaussian Splatting Paper • 2504.09097 • Published Apr 12, 2025