Adding the third dimension to spatial relation detection in 2D images

Brandon Birmingham; Adrian Muscat; Anja Belz

Conference ProceedingsOPEN ACCESS

Adding the third dimension to spatial relation detection in 2D images

INLG 2018 - 11th International Natural Language Generation Conference, Proceedings of the Conference (2018) 146-151

DOI: 10.18653/v1/w18-6517

8Citations

67Readers

Abstract

Detection of spatial relations between objects in images is currently a popular subject in image description research. A range of different language and geometric object features have been used in this context, but methods have not so far used explicit information about the third dimension (depth), except when manually added to annotations. The lack of such information hampers detection of spatial relations that are inherently 3D. In this paper, we use a fully automatic method for creating a depth map of an image and derive several different object-level depth features from it which we add to an existing feature set to test the effect on spatial relation detection. We show that performance increases are obtained from adding depth features in all scenarios tested.

Cite

CITATION STYLE

APA

Birmingham, B., Muscat, A., & Belz, A. (2018). Adding the third dimension to spatial relation detection in 2D images. In INLG 2018 - 11th International Natural Language Generation Conference, Proceedings of the Conference (pp. 146–151). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w18-6517

Adding the third dimension to spatial relation detection in 2D images

Abstract

Cite

Register to see more suggestions