To develop a generic method for document recognition, it is necessary to build a system with a generic approach for dealing with noise. Indeed, a lot of noise is present in an image and a recognizer needs to find the right information in the middle of noise to make a recognition. We describe in this paper the parser we develop in DMOS, a generic method for structured document recognition. This method use EPF, a grammatical language for describing documents. From an EPF description, a new recognition system is automatically build by compilation. DMOS had been successfully used for musical scores, mathematical formulae, table structure and old forms recognition (tested on 60,000 documents). To illustrate the dealing of noise and to show how it is easy to define a grammatical description in EPF, we present in this paper a real and complete grammar defined to detect tennis court in videos. Even if this application is not directly on document, tennis court offers a good illustration example and has the same kind of problems as those found in structured documents. © Springer-Verlag 2004.
CITATION STYLE
Coüasnon, B. (2004). Dealing with Noise in DMOS, a Generic Method for Structured Document Recognition: An Example on a Complete Grammar. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3088, 38–49. https://doi.org/10.1007/978-3-540-25977-0_4
Mendeley helps you to discover research relevant for your work.