Term ranking adaptation to the domain: Genetic algorithm-based optimisation of the C-value

Thierry Hamon; Christopher Engström; Sergei Silvestrov

Journal Article

Term ranking adaptation to the domain: Genetic algorithm-based optimisation of the C-value

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8686 71-83

DOI: 10.1007/978-3-319-10888-9_8

3Citations

4Readers

Get full text

Abstract

Term extraction methods based on linguistic rules have been proposed to help the terminology building from corpora. As they face the difficulty of identifying the relevant terms among the noun phrases extracted, statistical measures have been proposed. However, the term selection results may depend on corpus and strong assumptions reflecting specific terminological practice. We tackle this problem by proposing a parametrised C-Value which optimally considers the length and the syntactic roles of the nested terms thanks to a genetic algorithm. We compare its impact on the ranking of terms extracted from three corpora. Results show average precision increased by 9% above the frequencybased ranking and by 12% above the C-Value-based ranking.

Author supplied keywords

Cite

CITATION STYLE

APA

Hamon, T., Engström, C., & Silvestrov, S. (2014). Term ranking adaptation to the domain: Genetic algorithm-based optimisation of the C-value. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 8686, 71–83. https://doi.org/10.1007/978-3-319-10888-9_8

Term ranking adaptation to the domain: Genetic algorithm-based optimisation of the C-value

Abstract

Author supplied keywords

Cite

Register to see more suggestions