Skip to yearly menu bar Skip to main content


Poster Sat, Jun 6, 2026 • 10:45 AM – 12:45 PM PDT ExHall F

SPARROW: Learning Spatial Precision and Temporal Referential Consistency in Pixel-Grounded Video MLLMs

Mohamad Alansari ⋅ Naufal Suryanto ⋅ Divya Velayudhan ⋅ Sajid Javed ⋅ Naoufel Werghi ⋅ Muzammal Naseer

Abstract

Log in and register to view live content