Generative image modeling using style and structure adversarial networks

Xiaolong Wang; Abhinav Gupta

Conference ProceedingsOPEN ACCESS

Generative image modeling using style and structure adversarial networks

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9908 LNCS 318-335

DOI: 10.1007/978-3-319-46493-0_20

351Citations

573Readers

Abstract

Current generative frameworks use end-to-end learning and generate images by sampling from uniform noise distribution. However, these approaches ignore the most basic principle of image formation: images are product of: (a) Structure: the underlying 3D model; (b) Style: the texture mapped onto structure. In this paper, we factorize the image generation process and propose Style and Structure Generative Adversarial Network (S2-GAN). Our S2-GAN has two components: the Structure-GAN generates a surface normal map; the Style-GAN takes the surface normal map as input and generates the 2D image. Apart from a real vs. generated loss function, we use an additional loss with computed surface normals from generated images. The two GANs are first trained independently, and then merged together via joint learning. We show our S2-GAN model is interpretable, generates more realistic images and can be used to learn unsupervised RGBD representations.

Cite

CITATION STYLE

APA

Wang, X., & Gupta, A. (2016). Generative image modeling using style and structure adversarial networks. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9908 LNCS, pp. 318–335). Springer Verlag. https://doi.org/10.1007/978-3-319-46493-0_20

Generative image modeling using style and structure adversarial networks

Abstract

Cite

Register to see more suggestions