Part-of-speech tagging with evolutionary algorithms

Lourdes Araujo

Conference Proceedings

Part-of-speech tagging with evolutionary algorithms

Araujo L

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2002) 2276 230-239

DOI: 10.1007/3-540-45715-1_21

22Citations

8Readers

Get full text

Abstract

This paper presents a part-of-speech tagger based on a genetic algorithm which, after the “evolution” of a population of sequences of tags for the words in the text, selects the best individual as solution. The paper describes the main issues arising in the algorithm, such as the chromosome representation and the evaluation and design of genetic operators for crossover and mutation. A probabilistic model, based on the context of each word (the tags of the surrounding words) has been devised in order to define the fitness function. The model has been implemented and different issues have been investigated: size of the training corpus, effect of the context size, and parameters of the evolutionary algorithm, such as population size and crossover and mutation rates. The accuracy obtained with this method is comparable to that of other probabilistic approaches, but evolutionary algorithms are more efficient in obtaining the results.

Cite

CITATION STYLE

APA

Araujo, L. (2002). Part-of-speech tagging with evolutionary algorithms. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2276, pp. 230–239). Springer Verlag. https://doi.org/10.1007/3-540-45715-1_21

Part-of-speech tagging with evolutionary algorithms

Abstract

Cite

Register to see more suggestions