Topic detection of unrestricted texts: Approaches and evaluations

4Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Topic detection and tracking refers to automatic techniques for locating topically related cohesive paragraphs in a stream of text. Most documents are about more than one subject, but many Natural Language Processing (NLP) and Information Retrieval (IR) techniques implicitly assume documents have just one topic. Even in the presence of a single topic within a document, the document may address multiple subtopics and various aspects of the primary topic. Hence, dividing documents into topically coherent units and discovering their topic might have many uses. We describe new clues that account for the topic of grouping of contiguous portions of the text. Those clues are based on general lexical resources, which make them applicable to unrestricted texts, and can have many uses such as helping users find answers to general questions in an information search task, or in question/answering systems, or in text summarization. We devise an algorithm for identifying these clues, and we report on the performance of these clues, as well as the improvements suggested by our experiments.

Cite

CITATION STYLE

APA

Chali, Y. (2005). Topic detection of unrestricted texts: Approaches and evaluations. Applied Artificial Intelligence, 19(2), 119–135. https://doi.org/10.1080/08839510590887441

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free