Visual-Linguistic Methods for Receipt Field Recognition

Rinon Gal; Nimrod Morag; Roy Shilkrot

Conference Proceedings

Visual-Linguistic Methods for Receipt Field Recognition

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11362 LNCS 542-557

DOI: 10.1007/978-3-030-20890-5_35

1Citations

8Readers

Get full text

Abstract

Receipts are crucial for many businesses’ operation, where expenses are tracked meticulously. Receipt documents are often scanned into images, digitized and analyzed before the information is streamed into institutional financial applications. The precise extraction of expense data from receipt images is a difficult task owed to the high variance in fonts and layouts, the frailty of the print paper, unstructured scanning environments and an immeasurable amount of domains. We propose a method that combines visual and linguistic features for automatic information retrieval from receipt images using deep network architectures, which outperforms existing approaches. Our Skip-Rect Embedding (SRE) descriptor is demonstrated in two canonical applications for receipt information retrieval: field extraction and Optical Character Recognition (OCR) error enhancement.

Cite

CITATION STYLE

APA

Gal, R., Morag, N., & Shilkrot, R. (2019). Visual-Linguistic Methods for Receipt Field Recognition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11362 LNCS, pp. 542–557). Springer Verlag. https://doi.org/10.1007/978-3-030-20890-5_35

Visual-Linguistic Methods for Receipt Field Recognition

Abstract

Cite

Register to see more suggestions