Skip to yearly menu bar Skip to main content


Poster Sat, Jun 6, 2026 • 3:45 PM – 5:45 PM PDT ExHall A & F

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

Zhengyang Sun ⋅ Yu Chen ⋅ Xin Zhou ⋅ Xiaofan Li ⋅ Xiwu Chen ⋅ Dingkang Liang ⋅ Xiang Bai

Abstract

Log in and register to view live content