Skip to yearly menu bar Skip to main content


Poster Sat, Jun 6, 2026 • 3:45 PM – 5:45 PM PDT ExHall A & F

UVU: Improving Multimodal Understanding via Vision-Language Unified Autoregressive Paradigm

Zhehan Kan ⋅ Xinghua Jiang ⋅ Yanlin Liu ⋅ Xiaochen Yang ⋅ ZHIXIANG WEI ⋅ Shifeng Liu ⋅ Yubo Zhu ⋅ Qingmin Liao ⋅ Wenming Yang ⋅ Xin Li ⋅ Yinsong Liu ⋅ Deqiang Jiang ⋅ Xing Sun

Abstract

Log in and register to view live content