Identifying the language of an e-text is complicated by the existence of a number of character sets for a single language. We present a language identification system that uses the Multivariate Analysis (MVA) for dimensionality reduction and classification. We compare its performance with existing schemes viz., the N-grams and compression. © Springer-Verlag Berlin Heidelberg 2005.
CITATION STYLE
Vinosh Babu, J., & Baskaran, S. (2005). Automatic language identification using Multivariate analysis. In Lecture Notes in Computer Science (Vol. 3406, pp. 789–792). Springer Verlag. https://doi.org/10.1007/978-3-540-30586-6_89
Mendeley helps you to discover research relevant for your work.