SReN: Shape regression network for comic storyboard extraction

Zheqi He; Yafeng Zhou; Yongtao Wang; Zhi Tang

Conference ProceedingsOPEN ACCESS

SReN: Shape regression network for comic storyboard extraction

31st AAAI Conference on Artificial Intelligence, AAAI 2017 (2017) 4937-4938

DOI: 10.1609/aaai.v31i1.11074

8Citations

17Readers

Abstract

The goal of storyboard extraction is to decompose the comic image into storyboards, which is the fundamental step of comic image understanding and producing digital comic documents suitable for mobile reading. Most of existing approaches are based on hand crafted low-level visual patters like edge segments and line segments, which do not capture high-level vision information. To overcome this drawback of the existing approaches, we propose a novel architecture based on deep convolutional neural network, named Shape Regression Network (SReN), to detect storyboards within comic images. Firstly, we use Fast R-CNN to generate rectangle bounding boxes as storyboard proposals. Then we train a deep neural network to predict quadrangles for these proposals. Unlike existing object detection methods which only output rectangle bounding boxes, SReN can produce more precise quadrangle bounding boxes. Experimental results on 7382 comic pages, demonstrate that SReN outperforms the state-of-the-art methods by more than 10% in terms of F1-score and page correction rate.

Cite

CITATION STYLE

APA

He, Z., Zhou, Y., Wang, Y., & Tang, Z. (2017). SReN: Shape regression network for comic storyboard extraction. In 31st AAAI Conference on Artificial Intelligence, AAAI 2017 (pp. 4937–4938). AAAI press. https://doi.org/10.1609/aaai.v31i1.11074

SReN: Shape regression network for comic storyboard extraction

Abstract

Cite

Register to see more suggestions