Learning dilation factors for semantic segmentation of street scenes

Yang He; Margret Keuper; Bernt Schiele; Mario Fritz

Conference Proceedings

Learning dilation factors for semantic segmentation of street scenes

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10496 LNCS 41-51

DOI: 10.1007/978-3-319-66709-6_4

10Citations

32Readers

Get full text

Abstract

Contextual information is crucial for semantic segmentation. However, finding the optimal trade-off between keeping desired fine details and at the same time providing sufficiently large receptive fields is non trivial. This is even more so, when objects or classes present in an image significantly vary in size. Dilated convolutions have proven valuable for semantic segmentation, because they allow to increase the size of the receptive field without sacrificing image resolution. However, in current state-of-the-art methods, dilation parameters are hand-tuned and fixed. In this paper, we present an approach for learning dilation parameters adaptively per channel, consistently improving semantic segmentation results on street-scene datasets like Cityscapes and Camvid.

Cite

CITATION STYLE

APA

He, Y., Keuper, M., Schiele, B., & Fritz, M. (2017). Learning dilation factors for semantic segmentation of street scenes. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10496 LNCS, pp. 41–51). Springer Verlag. https://doi.org/10.1007/978-3-319-66709-6_4

Learning dilation factors for semantic segmentation of street scenes

Abstract

Cite

Register to see more suggestions