QEVA: A Reference-Free Evaluation Metric for Narrative Video Summarization with Multimodal Question Answering Paper • 2604.24052 • Published 4 days ago
Visual Funnel: Resolving Contextual Blindness in Multimodal Large Language Models Paper • 2512.10362 • Published Dec 11, 2025 • 1