The extraction of semantics of unstructured documents requires the recognition and classification of textual patterns, their variability, and their inter-relationships, i.e., the analysis of the linguistic structure of documents. Being the integral part of a larger real-life application, this linguistic analysis process must be robust, fast and adaptable. This creates a big challenge for the development of the necessary linguistic base components. In this drill-down, we present several dimensions of this challenge and show how they have been successfully tackled in Ordo. © Springer International Publishing Switzerland 2014.
CITATION STYLE
Neumann, G., Paaß, G., & van den Akker, D. (2014). Linguistics to Structure Unstructured Information. Cognitive Technologies, 39, 383–392. https://doi.org/10.1007/978-3-319-06755-1_29
Mendeley helps you to discover research relevant for your work.