Skip to yearly menu bar Skip to main content


Poster Sat, Jun 6, 2026 • 10:45 AM – 12:45 PM PDT ExHall F

SpaceMind: Camera-Guided Modality Fusion for Spatial Reasoning in Vision-Language Models

Ruosen Zhao ⋅ Zhikang Zhang ⋅ Jialei Xu ⋅ Jiahao Chang ⋅ Dong Chen ⋅ Lingyun Li ⋅ Weijian Sun ⋅ Zizhuang Wei

Abstract

Log in and register to view live content