Skip to yearly menu bar Skip to main content


Poster Sun, Jun 7, 2026 • 10:45 AM – 12:45 PM PDT ExHall F

SegMo: Co-Designing Content-Aware Sparsity and Locally-Cohesive Segment Parallelism for Efficient VLM Inference

Haojuan Li ⋅ Ruohan Tang ⋅ Dongzhou Cheng ⋅ Zongpu Zhang ⋅ Jian Li ⋅ Jiaqi Wang

Abstract

Log in and register to view live content