We examine the nature of ground-truth: whether it is always well-defined for a given task, or only relative and approximate. In the conventional scenario, reference data is produced by recording the interpretation of each test document using a chosen data-entry platform. Looking a little more closely at this process, we study its constituents and their interrelations. We provide examples from the literature and from our own experiments where non-trivial problems with each of the components appear to preclude the possibility of real progress in evaluating automated graphics recognition systems, and propose possible solutions. More specifically, for documents with complex structure we recommend multi-valued, layered, weighted, functional ground-truth supported by model-guided reference data-entry systems and protocols. Mostly, however, we raise far more questions than we currently have answers for.
CITATION STYLE
Lopresti, D., Nagy, G., Blostein, D., & Kwon, Y.-B. (2002). Graphics Recognition Algorithms and Applications. (D. Blostein & Y.-B. Kwon, Eds.), Graphics Recognition Algorithms and Applications (Vol. 2390, pp. 46–67). Berlin, Heidelberg: Springer Berlin Heidelberg. Retrieved from http://www.springerlink.com/content/p4864636l02uh383/
Mendeley helps you to discover research relevant for your work.