Innovative Text Extraction Algorithm Based on TensorFlow

Shichen Zhai; Xiaogang Wang; Di Xiao; Zhiwen Li

Conference Proceedings

Innovative Text Extraction Algorithm Based on TensorFlow

Advances in Intelligent Systems and Computing (2019) 834 529-536

DOI: 10.1007/978-981-13-5841-8_55

0Citations

2Readers

Get full text

Abstract

Extracting business registration information exploiting graphic recognition algorithms on the Internet nowadays is vital to e-commercial business. However, business registration information is usually presented in graphics and existing graphic recognition systems have been hindered because of their slow detection speed, low accuracy, and complex operations. Thereby, we propose an innovative text extraction algorithm based on TensorFlow (TEAT). We first utilize the web crawler to obtain the data source, and then extract the character information by using our TEAT based on TensorFlow framework recognition technology. Our TEAT algorithm can extract business registration information efficiently and effectively. Comparing with existing text extraction algorithm based on Tess4j framework for extracting Tmall shop business license picture information, our TEAT has obvious advantages over Tess4j framework with higher accuracy and efficiency.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhai, S., Wang, X., Xiao, D., & Li, Z. (2019). Innovative Text Extraction Algorithm Based on TensorFlow. In Advances in Intelligent Systems and Computing (Vol. 834, pp. 529–536). Springer Verlag. https://doi.org/10.1007/978-981-13-5841-8_55

Innovative Text Extraction Algorithm Based on TensorFlow

Abstract

Author supplied keywords

Cite

Register to see more suggestions