Skip to yearly menu bar Skip to main content


Poster Sat, Jun 6, 2026 • 3:45 PM – 5:45 PM PDT ExHall A & F

AMusE: Audio-Visual Benchmark and Alignment Framework for Agentic Multi-Speaker Understanding

Sanjoy Chowdhury ⋅ Karren Dai Yang ⋅ Xudong Liu ⋅ Fartash Faghri ⋅ Pavan Kumar Anasosalu Vasu ⋅ Oncel Tuzel ⋅ Dinesh Manocha ⋅ Chun-Liang Li ⋅ Raviteja Vemulapalli

Abstract

Log in and register to view live content