Recently, Chinglish in Web Text is one of new language phenomena, and has brought some problems for automatic analysis of natural language processing. This paper builds a small-scale open Chinglish corpus for NLP, then analyzes the linguistic characteristics of Chinglish in Web Text from two aspects: vocabulary and grammar, as well as Chinese-English translation of phrases and sentences. The study can be helpful for natural language processing, such as machine translation, sentiment analysis and information extraction.
CITATION STYLE
Chen, B., Lyu, C., & Ji, Z. (2018). Study on Chinglish in Web Text for Natural Language Processing. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10709 LNAI, pp. 533–539). Springer Verlag. https://doi.org/10.1007/978-3-319-73573-3_48
Mendeley helps you to discover research relevant for your work.