Image-to-voxel model translation with conditional adversarial networks

Vladimir A. Knyaz; Vladimir V. Kniaz; Fabio Remondino

Conference ProceedingsOPEN ACCESS

Image-to-voxel model translation with conditional adversarial networks

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11129 LNCS 601-618

DOI: 10.1007/978-3-030-11009-3_37

43Citations

456Readers

Abstract

We present a single-view voxel model prediction method that uses generative adversarial networks. Our method utilizes correspondences between 2D silhouettes and slices of a camera frustum to predict a voxel model of a scene with multiple object instances. We exploit pyramid shaped voxel and a generator network with skip connections between 2D and 3D feature maps. We collected two datasets VoxelCity and VoxelHome to train our framework with 36,416 images of 28 scenes with ground-truth 3D models, depth maps, and 6D object poses. We made the datasets publicly available (http://www.zefirus.org/Z_GAN). We evaluate our framework on 3D shape datasets to show that it delivers robust 3D scene reconstruction results that compete with and surpass state-of-the-art in a scene reconstruction with multiple non-rigid objects.

Author supplied keywords

Cite

CITATION STYLE

APA

Knyaz, V. A., Kniaz, V. V., & Remondino, F. (2019). Image-to-voxel model translation with conditional adversarial networks. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11129 LNCS, pp. 601–618). Springer Verlag. https://doi.org/10.1007/978-3-030-11009-3_37

Image-to-voxel model translation with conditional adversarial networks

Abstract

Author supplied keywords

Cite

Register to see more suggestions