Skip to yearly menu bar Skip to main content


Poster Fri, Jun 5, 2026 • 3:00 PM – 5:00 PM PDT ExHall A & F

SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization

Xuankun Rong ⋅ Wenke Huang ⋅ Tingfeng Wang ⋅ Daiguo Zhou ⋅ Bo Du ⋅ Mang Ye

Abstract

Log in and register to view live content