Learning to generate object segment proposals with multi-modal cues

Haoyang Zhang; Xuming He; Fatih Porikli

Conference Proceedings

Learning to generate object segment proposals with multi-modal cues

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10111 LNCS 121-136

DOI: 10.1007/978-3-319-54181-5_8

0Citations

7Readers

Get full text

Abstract

This paper presents a learning-based object segmentation proposal generation method for stereo images. Unlike existing methods which mostly rely on low-level appearance cue and handcrafted similarity functions to group segments, our method makes use of learned deep features and designed geometric features to represent a region, as well as a learned similarity network to guide the grouping process. Given an initial segmentation hierarchy, we sequentially merge adjacent regions in each level based on their affinity measured by the similarity network. This merging process generates new segmentation hierarchies, which are then used to produce a pool of regional proposals by taking region singletons, pairs, triplets and 4-tuples from them. In addition, we learn a ranking network that predicts the objectness score of each regional proposal and diversify the ranking based on Maximum Marginal Relevance measures. Experiments on the Cityscapes dataset show that our approach performs significantly better than the baseline and the current state-of-the-art.

Cite

CITATION STYLE

APA

Zhang, H., He, X., & Porikli, F. (2017). Learning to generate object segment proposals with multi-modal cues. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10111 LNCS, pp. 121–136). Springer Verlag. https://doi.org/10.1007/978-3-319-54181-5_8

Learning to generate object segment proposals with multi-modal cues

Abstract

Cite

Register to see more suggestions