Skip to yearly menu bar Skip to main content


Poster

VideoGLaMM : A Large Multimodal Model for Pixel-Level Visual Grounding in Videos

Shehan Munasinghe ⋅ Hanan Gani ⋅ Wenqi Zhu ⋅ Jiale Cao ⋅ Eric P. Xing ⋅ Fahad Shahbaz Khan ⋅ Salman Khan
2025 Poster

Abstract

Chat is not available.