By appearance of social media, people are coming to be able to transmit information easily on a personal level. However, because users of social media generally spend little time on describing information, low-quality texts are transmitted and it blocks the spread of information. On transmitted texts in social media, commas and linefeeds are inserted incorrectly, and it becomes a factor of low-quality texts. This paper proposes a method for automatically formatting Japanese texts in social media. Our method formats texts by inserting commas and line-feeds appropriately. In our method, the positions where commas and linefeeds should be inserted are decided based on machine learning using morphological information, dependency relation and clause boundary information. An experiment using Japanese spoken language texts has shown the effectiveness of our method. © 2011 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Murata, M., Ohno, T., & Matsubara, S. (2011). Automatic text formatting for social media based on linefeed and comma insertion. In Smart Innovation, Systems and Technologies (Vol. 11 SIST, pp. 285–294). https://doi.org/10.1007/978-3-642-22158-3_28
Mendeley helps you to discover research relevant for your work.