Pattern mining across domain-specific text collections

Lee Gillam; Khurshid Ahmad

Conference Proceedings

Pattern mining across domain-specific text collections

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2005) 3587 LNAI 570-579

DOI: 10.1007/11510888_56

9Citations

14Readers

Get full text

Abstract

This paper discusses a consistency in patterns of language use across domain-specific collections of text. We present a method for the automatic identification of domain-specific keywords - specialist terms - based on comparing language use in scientific domain-specific text collections with language use in texts intended for a more general audience. The method supports automatic production of collocational networks, and of networks of concepts thesauri, or so-called ontologies. The method involves a novel combination of existing metrics from work in computational linguistics, which can enable extraction, or learning, of these kinds of networks. Creation of ontologies or thesauri is informed by international (ISO) standards in terminology science, and the resulting resource can be used to support a variety of work, including data-mining applications. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Gillam, L., & Ahmad, K. (2005). Pattern mining across domain-specific text collections. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3587 LNAI, pp. 570–579). Springer Verlag. https://doi.org/10.1007/11510888_56

Pattern mining across domain-specific text collections

Abstract

Cite

Register to see more suggestions