Skip to yearly menu bar Skip to main content


Poster Sat, Jun 6, 2026 • 10:45 AM – 12:45 PM PDT ExHall F

StructXLIP: Enhancing Vision-language Models with Multimodal Structural Cues

Zanxi Ruan ⋅ Songqun Gao ⋅ Qiuyu Kong ⋅ Yiming Wang ⋅ Marco Cristani

Abstract

Log in and register to view live content