Several tasks approached by using text mining techniques, like text categorization, document clustering, or information retrieval, operate on the document level, making use of the so-called bag-of-words model. Other tasks, like document summarization, information extraction, or question answering, have to operate on the sentence level, in order to fulfill their specific requirements. While both groups of text mining tasks are typically affected by the problem of data sparsity, this is more accentuated for the latter group of tasks. Thus, while the tasks of the first group can be tackled by statistical and machine learning methods based on a bag-of-words approach alone, the tasks of the second group need natural language processing (NLP) at the sentence or paragraph level in order to produce more informative features. © 2007 Springer-Verlag London Limited.
CITATION STYLE
Mustafaraj, E., Hoof, M., & Freisleben, B. (2007). Mining diagnostic text reports by learning to annotate knowledge roles. In Natural Language Processing and Text Mining (pp. 45–67). Springer London. https://doi.org/10.1007/978-1-84628-754-1_4
Mendeley helps you to discover research relevant for your work.