MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
Paper • 2408.13257 • Published • 26
Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?