To alleviate the problem with Chinese being poor at telling the difference between spoken and written English which is important for learning and using the language, we propose to classify English sentences with bilingual information into the two categories automatically. Based on the text categorization technology, we explore a variety of features, including words, statistics and their combinations, and find that a classification accuracy nearly 95% can be achieved in the open test through Chinese characters + sentence length + average syllable number, or other similar combinations. © Springer-Verlag Berlin Heidelberg 2013.
CITATION STYLE
Li, K., Xiong, Z., Zhang, Y., Liu, X., Zhou, M., & Zhang, G. (2013). The Spoken/Written Language classification of English sentences with bilingual information. In Communications in Computer and Information Science (Vol. 400, pp. 370–377). Springer Verlag. https://doi.org/10.1007/978-3-642-41644-6_35
Mendeley helps you to discover research relevant for your work.