Per-node optimization of finite-state mechanisms for natural language processing

Alexander Troussov; Brian O'Donovan; Seppo Koskenniemi; Nikolay Glushnev

Journal Article

Per-node optimization of finite-state mechanisms for natural language processing

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2003) 2588 221-224

DOI: 10.1007/3-540-36456-0_22

2Citations

12Readers

Get full text

Abstract

Finite-state processing is typically based on structures that allow for efficient indexing and sequential search. However, this "rigid" framework has several disadvantages when used in natural language processing, especially for non-alphabetical languages. The solution is to systematically introduce polymorphic programming techniques that are adapted to particular cases. In this paper we describe the structure of a morphological dictionary implemented with finite-state automata using variable or polymorphic node formats. Each node is assigned a format from a predefined set reflecting its utility in corpora processing as measured by a number of graph theoretic metrics and statistics. Experimental results demonstrate that this approach permits a 52% increase in the performance of dictionary look-up. © Springer-Verlag Berlin Heidelberg 2003.

Cite

CITATION STYLE

APA

Troussov, A., O’Donovan, B., Koskenniemi, S., & Glushnev, N. (2003). Per-node optimization of finite-state mechanisms for natural language processing. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2588, 221–224. https://doi.org/10.1007/3-540-36456-0_22

Per-node optimization of finite-state mechanisms for natural language processing

Abstract

Cite

Register to see more suggestions