Paper
in
Workshop: PixFoundation: Workshop on Pixel-level Vision Foundation Models
Hierarchical Semantic Segmentation with Autoregressive Language Modeling
Josh Myers-Dean · Brian Price · Yifei Fan · Danna Gurari
Abstract:
Hierarchical semantic segmentation entails progressively decomposing objects into smaller nested parts. Existing approaches either require multiple inference passes or multiple, fixed decoders. We instead introduce HALLUMI, an autoregressive language modeling framework that performs the task in one inference pass, relying on special tokens to indicate parent-child relationships so the hierarchy can be recovered from the generated text. Experiments on a hierarchical semantic segmentation dataset to the subpart-level (SPIN) show HALLUMI achieves state-of-the-art results.
Chat is not available.
Successful Page Load