A composition algorithm of compact finite-state super transducers for grapheme-to-phoneme conversion

Žiga Golob; Jerneja Žganec Gros; Vitomir Štruc; France Mihelič; Simon Dobrišek

Book Chapter

A composition algorithm of compact finite-state super transducers for grapheme-to-phoneme conversion

Springer Verlag, (2016), 375-382

DOI: 10.1007/978-3-319-45510-5_43

2Citations

1Readers

Get full text

Abstract

Minimal deterministic finite-state transducers (MDFSTs) are powerful models that can be used to represent pronunciation dictionaries in a compact form. Intuitively, we would assume that by increasing the size of the dictionary, the size of the MDFSTs would increase as well. However, as we show in the paper, this intuition does not hold for highly inflected languages. With such languages the size of the MDFSTs begins to decrease once the number of words in the represented dictionary reaches a certain threshold. Motivated by this observation, we have developed a new type of FST, called a finite-state super transducer (FSST), and show experimentally that the FSST is capable of representing pronunciation dictionaries with fewer states and transitions than MDFSTs. Furthermore, we show that (unlike MDFSTs) our FSSTs can also accept words that are not part of the represented dictionary. The phonetic transcriptions of these out-of-dictionary words may not always be correct, but the observed error rates are comparable to the error rates of the traditional methods for grapheme-to-phoneme conversion.

Author supplied keywords

Cite

CITATION STYLE

APA

Golob, Ž., Gros, J. Ž., Štruc, V., Mihelič, F., & Dobrišek, S. (2016). A composition algorithm of compact finite-state super transducers for grapheme-to-phoneme conversion. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9924 LNCS, pp. 375–382). Springer Verlag. https://doi.org/10.1007/978-3-319-45510-5_43

A composition algorithm of compact finite-state super transducers for grapheme-to-phoneme conversion

Abstract

Author supplied keywords

Cite

Register to see more suggestions