This paper describes the motivation and the method that we usedfor aligning digital spoken books, and the results obtained both at a word level and at a phone level. This alignment will allow specific access interfaces for persons with special needs, and also tools for easily detecting and indexing units (words, sentences, topics) in the spoken books. The tool was implemented in a Weighted Finite State Transducer framework, which provides an efficient way to combine different types of knowledge sources, such as alternative pronunciation rules. With this tool, a 2-hour long spoken book was alignedin a single step in much less than real time.
CITATION STYLE
Serralheiro, A., Caseiro, D., Meinedo, H., & Trancoso, I. (2002). Word alignment in digital talking books using WFSTs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2458, pp. 508–515). Springer Verlag. https://doi.org/10.1007/3-540-45747-x_37
Mendeley helps you to discover research relevant for your work.