Skip to yearly menu bar Skip to main content


Poster

ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation

Ali Athar ⋅ Xueqing Deng ⋅ Liang-Chieh Chen
2025 Poster

Abstract

Chat is not available.