PixelLink: Detecting scene text via instance segmentation

480Citations
Citations of this article
462Readers
Mendeley users who have this article in their library.

Abstract

Most state-of-the-art scene text detection algorithms are deep learning based methods that depend on bounding box regression and perform at least two kinds of predictions: text/non-text classification and location regression. Regression plays a key role in the acquisition of bounding boxes in these methods, but it is not indispensable because text/non-text prediction can also be considered as a kind of semantic segmentation that contains full location information in itself. However, text instances in scene images often lie very close to each other, making them very difficult to separate via semantic segmentation. Therefore, instance segmentation is needed to address this problem. In this paper, PixelLink, a novel scene text detection algorithm based on instance segmentation, is proposed. Text instances are first segmented out by linking pixels within the same instance together. Text bounding boxes are then extracted directly from the segmentation result without location regression. Experiments show that, compared with regression-based methods, PixelLink can achieve better or comparable performance on several benchmarks, while requiring many fewer training iterations and less training data.

References Powered by Scopus

SSD: Single shot multibox detector

25238Citations
N/AReaders
Get full text

Fully convolutional networks for semantic segmentation

24977Citations
N/AReaders
Get full text

Training region-based object detectors with online hard example mining

2260Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Character region awareness for text detection

803Citations
N/AReaders
Get full text

Shape robust text detection with progressive scale expansion network

610Citations
N/AReaders
Get full text

Efficient and accurate arbitrary-shaped text detection with pixel aggregation network

526Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Deng, D., Liu, H., Li, X., & Cai, D. (2018). PixelLink: Detecting scene text via instance segmentation. In 32nd AAAI Conference on Artificial Intelligence, AAAI 2018 (pp. 6773–6780). AAAI press. https://doi.org/10.1609/aaai.v32i1.12269

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 159

72%

Researcher 45

20%

Lecturer / Post doc 12

5%

Professor / Associate Prof. 6

3%

Readers' Discipline

Tooltip

Computer Science 195

80%

Engineering 37

15%

Mathematics 8

3%

Agricultural and Biological Sciences 4

2%

Save time finding and organizing research with Mendeley

Sign up for free