Abstract
This paper presents an algorithm for learning the probabilities of optional phonological rules from corpora. The algorithm is based on using a speech recognition system to discover the surface pronunciations of words in speech corpora; using an automatic system obviates expensive phonetic labeling by hand. We describe the details of our algorithm and show the probabilities the system has learned for ten common phonological rules which model reductions and coarticulation effects. These probabilities were derived from a corpus of 7203 sentences of read speech from the Wall Street Journal, and are shown to be a reasonably close match to probabilities from phonetically hand-transcribed data (TIMIT). Finally, we analyze the probability differences between rule use in male versus female speech, and suggest that the differences are caused by differing average rates of speech.
Cite
CITATION STYLE
Tajchman, G., Jurafsky, D., & Fosler, E. (1995). Learning phonological rule probabilities from speech corpora with exploratory computational phonology. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1995-June, pp. 1–8). Association for Computational Linguistics (ACL). https://doi.org/10.3115/981658.981659
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.