In a multi-language Information Retrieval setting, the knowledge about the language of a user query is important for further processing. Hence, we compare the performance of some typical approaches for language detection on very short, query-style texts. The results show that already for single words an accuracy of more than 80% can be achieved, for slightly longer texts we even observed accuracy values close to 100%. © 2010 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Gottron, T., & Lipka, N. (2010). A comparison of language identification approaches on short, query-style texts. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5993 LNCS, pp. 611–614). Springer Verlag. https://doi.org/10.1007/978-3-642-12275-0_59
Mendeley helps you to discover research relevant for your work.