Deep modular network architecture for depth estimation from single indoor images

Seiya Ito; Naoshi Kaneko; Yuma Shinohara; Kazuhiko Sumi

Conference ProceedingsOPEN ACCESS

Deep modular network architecture for depth estimation from single indoor images

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11129 LNCS 324-336

DOI: 10.1007/978-3-030-11009-3_19

1Citations

3Readers

Abstract

We propose a novel deep modular network architecture for indoor scene depth estimation from single RGB images. The proposed architecture consists of a main depth estimation network and two auxiliary semantic segmentation networks. Our insight is that semantic and geometrical structures in a scene are strongly correlated, thus we utilize global (i.e. room layout) and mid-level (i.e. objects in a room) semantic structures to enhance depth estimation. The first auxiliary network, or layout network, is responsible for room layout estimation to infer the positions of walls, floor, and ceiling of a room. The second auxiliary network, or object network, estimates per-pixel class labels of the objects in a scene, such as furniture, to give mid-level semantic cues. Estimated semantic structures are effectively fed into the depth estimation network using newly proposed discriminator networks, which discern the reliability of the estimated structures. The evaluation result shows that our architecture achieves significant performance improvements over previous approaches on the standard NYU Depth v2 indoor scene dataset.

Author supplied keywords

Cite

CITATION STYLE

APA

Ito, S., Kaneko, N., Shinohara, Y., & Sumi, K. (2019). Deep modular network architecture for depth estimation from single indoor images. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11129 LNCS, pp. 324–336). Springer Verlag. https://doi.org/10.1007/978-3-030-11009-3_19

Deep modular network architecture for depth estimation from single indoor images

Abstract

Author supplied keywords

Cite

Register to see more suggestions