Finite-state Transducers (FST) can be very efficient to implement inter-dialectal transliteration. We illustrate this on the Hindi and Urdu language pair. FSTs can also be used for translation between surface-close languages. We introduce UIT (universal intermediate transcription) for the same pair on the basis of their common phonetic repository in such a way that it can be extended to other languages like Arabic, Chinese, English, French, etc. We describe a transliteration model based on FST and UIT, and evaluate it on Hindi and Urdu corpora. © 2008. Licensed under the Creative Commons.
CITATION STYLE
Malik, M. G. A., Boitet, C., & Bhattacharyya, P. (2008). Hindi urdu machine transliteration using Finite-state Transducers. In Coling 2008 - 22nd International Conference on Computational Linguistics, Proceedings of the Conference (Vol. 1, pp. 537–544). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1599081.1599149
Mendeley helps you to discover research relevant for your work.