Extracting significant time varying features from text

77Citations
Citations of this article
36Readers
Mendeley users who have this article in their library.

Abstract

We propose a simple statistical model for the frequency of occurrence of features in a stream of text. Adoption of this model allows us to use classical significance tests to filter the stream for interesting events. We tested the model by building a system and running it on a news corpus. By a subjective evaluation, the system worked remarkably well: almost all of the groups of identified tokens corresponded to news stories and were appropriately placed in time. A preliminary objective evaluation was also used to measure the quality of the system and it showed some of the weaknesses and the power of our approach.

Cited by Powered by Scopus

CiteSpace II: Detecting and visualizing emerging trends and transient patterns in scientific literature

4833Citations
N/AReaders
Get full text

Bursty and Hierarchical Structure in Streams

1050Citations
N/AReaders
Get full text

Bursty and hierarchical structure in streams

874Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Swan, R., & Allan, J. (1999). Extracting significant time varying features from text. In International Conference on Information and Knowledge Management, Proceedings (pp. 38–45). ACM. https://doi.org/10.1145/319950.319956

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 21

72%

Professor / Associate Prof. 7

24%

Researcher 1

3%

Readers' Discipline

Tooltip

Computer Science 28

90%

Social Sciences 1

3%

Engineering 1

3%

Economics, Econometrics and Finance 1

3%

Save time finding and organizing research with Mendeley

Sign up for free