RMGN: A Regional Mask Guided Network for Parser-free Virtual Try-on

6Citations
Citations of this article
24Readers
Mendeley users who have this article in their library.

Abstract

Virtual try-on (VTON) aims at fitting target clothes to reference person images, which is widely adopted in e-commerce. Existing VTON approaches can be narrowly categorized into Parser-Based (PB) and Parser-Free (PF) by whether relying on the parser information to mask the persons' clothes and synthesize try-on images. Although abandoning parser information has improved the applicability of PF methods, the ability of detail synthesizing has also been sacrificed. As a result, the distraction from original cloth may persist in synthesized images, especially in complicated postures and high resolution applications. To address the aforementioned issue, we propose a novel PF method named Regional Mask Guided Network (RMGN). More specifically, a regional mask is proposed to explicitly fuse the features of target clothes and reference persons so that the persisted distraction can be eliminated. A posture awareness loss and a multi-level feature extractor are further proposed to handle the complicated postures and synthesize high resolution images. Extensive experiments demonstrate that our proposed RMGN outperforms both state-of-the-art PB and PF methods. Ablation studies further verify the effectiveness of modules in RMGN. Code is available at https://github.com/jokerlc/RMGN-VITON.

Cite

CITATION STYLE

APA

Lin, C., Li, Z., Zhou, S., Hu, S., Zhang, J., Luo, L., … He, Y. (2022). RMGN: A Regional Mask Guided Network for Parser-free Virtual Try-on. In IJCAI International Joint Conference on Artificial Intelligence (pp. 1151–1158). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2022/161

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free