Workshop

Workshop on 3D-LLM/VLA: Bridging Language, Vision and Action in 3D Environments

Jianing "Jed" Yang ⋅ Shengyi Qian ⋅ Yining Hong ⋅ Valts Blukis ⋅ Xiaojian Ma ⋅ Yash Bhalgat ⋅ Iro Laina ⋅ Joyce Chai ⋅ David Fouhey

Project Page

Abstract

This workshop addresses a critical gap in current AI research by focusing on the integration of language and 3D perception, which is essential for developing embodied agents and robots, especially considering the recent rise of multimodal LLMs and vision-language-action (VLA) models.

The workshop will explore challenges and opportunities in this area, providing a platform for researchers to share their work, discuss future directions, and foster collaboration across disciplines including robotics, computer vision, natural language processing, and human-computer interaction.

Chat is not available.