Skip to yearly menu bar Skip to main content


Poster Sun, Jun 7, 2026 • 2:30 PM – 4:30 PM PDT ExHall A

R-C2: Cycle-Consistent Reinforcement Learning Improves Multimodal Reasoning

Zirui Zhang ⋅ Haoyu Dong ⋅ Kexin Pei ⋅ Chengzhi Mao

Abstract

Log in and register to view live content