DiningBench: A Hierarchical Multi-view Benchmark for Perception and Reasoning in the Dietary Domain Paper • 2604.10425 • Published 13 days ago • 3
ViPER: Empowering the Self-Evolution of Visual Perception Abilities in Vision-Language Model Paper • 2510.24285 • Published Oct 28, 2025 • 3
FinRpt: Dataset, Evaluation System and LLM-based Multi-agent Framework for Equity Research Report Generation Paper • 2511.07322 • Published Nov 10, 2025