Skip to yearly menu bar Skip to main content


Poster Sun, Jun 7, 2026 • 2:30 PM – 4:30 PM PDT ExHall A

Scenes as Tokens: Multi-Scale Normal Distributions Transform Tokenizer for General 3D Vision–Language Understanding

Yutao Tang ⋅ Cheng Zhao ⋅ Gaurav Mittal ⋅ Rohith Kukkala ⋅ Rama Chellappa ⋅ Cheng Peng ⋅ Mei Chen

Abstract

Log in and register to view live content