GRPO-CARE: Improving MLLM Reasoning with Consistency-Aware RL

June 26, 2025
Based on paper:2506.16141v1
Loading content...