MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO
Paper โข 2505.13031 โข Published โข 4
This repository contains the MindOmni model described in the paper MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO.