Skip to yearly menu bar Skip to main content


Poster

LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding

Hongyu Li ⋅ Jinyu Chen ⋅ Ziyu Wei ⋅ Shaofei Huang ⋅ Tianrui Hui ⋅ Jialin Gao ⋅ Xiaoming Wei ⋅ Si Liu
2025 Poster

Abstract

Chat is not available.