Multi-task model for comic book image analysis

Nhu Van Nguyen; Christophe Rigaud; Jean Christophe Burie

Conference Proceedings

Multi-task model for comic book image analysis

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11296 LNCS 637-649

DOI: 10.1007/978-3-030-05716-9_57

11Citations

14Readers

Get full text

Abstract

Comic book image analysis methods often propose multiple algorithms or models for multiple tasks like panels and characters detection, balloons segmentation and text recognition, etc. In this work, we aim to reduce the complexity for comic book image analysis by proposing one model which can learn multiple tasks called Comic MTL. In addition to the detection task and segmentation task, we integrate the relation analysis task for balloons and characters into the Comic MTL model. The experiments with our model are carried out on the eBDtheque dataset which contains the annotations for panels, balloons, characters and also the relations balloon-character. We show that the Comic MTL model can detect the association between balloons and their speakers (comic characters) and handle other tasks like panels, characters detection and balloons segmentation with promising results.

Author supplied keywords

Cite

CITATION STYLE

APA

Nguyen, N. V., Rigaud, C., & Burie, J. C. (2019). Multi-task model for comic book image analysis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11296 LNCS, pp. 637–649). Springer Verlag. https://doi.org/10.1007/978-3-030-05716-9_57

Multi-task model for comic book image analysis

Abstract

Author supplied keywords

Cite

Register to see more suggestions