RWTH OCR: A Large Vocabulary Optical Character Recognition System for Arabic Scripts

  • Dreuw P
  • Rybach D
  • Heigold G
  • et al.
N/ACitations
Citations of this article
21Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We present a novel large vocabulary OCR system, which implements a 5 confidence-and margin-based discriminative training approach for model adap-6 tation of an HMM based recognition system to handle multiple fonts, different 7 handwriting styles, and their variations. Most current HMM approaches are HTK 8 based systems which are maximum-likelihood (ML) trained and which try to adapt 9 their models to different writing styles using writer adaptive training, unsupervised 10 clustering, or additional writer specific data. Here, discriminative training based 11 on the Maximum Mutual Information (MMI) and Minimum Phone Error (MPE) 12 criteria are used instead. For model adaptation during decoding, an unsupervised 13 confidence-based discriminative training within a two-pass decoding process is pro-14 posed. Additionally, we use neural network based features extracted by a hierar-15 chical multi-layer-perceptron (MLP) network either in a hybrid MLP/HMM ap-16 proach or to discriminatively retrain a Gaussian HMM system in a tandem approach. 17 The proposed framework and methods are evaluated for closed-vocabulary isolated 18 handwritten word recognition on the IfN/ENIT Arabic handwriting database, where 19 the word-error-rate is decreased by more than 50% relative compared to a ML 20 trained baseline system. Preliminary results for large-vocabulary Arabic machine 21 printed text recognition tasks are presented on a novel publicly available newspaper 22 database.

Cite

CITATION STYLE

APA

Dreuw, P., Rybach, D., Heigold, G., & Ney, H. (2012). RWTH OCR: A Large Vocabulary Optical Character Recognition System for Arabic Scripts. In Guide to OCR for Arabic Scripts (pp. 215–254). Springer London. https://doi.org/10.1007/978-1-4471-4072-6_9

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free