EdgeNet: Semantic scene completion from a single RGB-D image

Aloisio Dourado; Teofilo E. de Campos; Hansung Kim; Adrian Hilton

Conference Proceedings

EdgeNet: Semantic scene completion from a single RGB-D image

Proceedings - International Conference on Pattern Recognition (2020) 503-510

DOI: 10.1109/ICPR48806.2021.9413252

25Citations

29Readers

Get full text

Abstract

Semantic scene completion is the task of predicting a complete 3D representation of volumetric occupancy with corresponding semantic labels for a scene from a single point of view. In this paper, we present EdgeNet, a new end-to-end neural network architecture that fuses information from depth and RGB, explicitly representing RGB edges in 3D space. Previous works on this task used either depth-only or depth with colour by projecting 2D semantic labels generated by a 2D segmentation network into the 3D volume, requiring a two step training process. Our EdgeNet representation encodes colour information in 3D space using edge detection and flipped truncated signed distance, which improves semantic completion scores especially in hard to detect classes. We achieved state-of-the-art scores on both synthetic and real datasets with a simpler and a more computationally efficient training pipeline than competing approaches.

Cite

CITATION STYLE

APA

Dourado, A., de Campos, T. E., Kim, H., & Hilton, A. (2020). EdgeNet: Semantic scene completion from a single RGB-D image. In Proceedings - International Conference on Pattern Recognition (pp. 503–510). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/ICPR48806.2021.9413252

EdgeNet: Semantic scene completion from a single RGB-D image

Abstract

Cite

Register to see more suggestions