Improved identification of tweets that mention books: Selection of effective features

Shuntaro Yada; Kyo Kageura

Conference Proceedings

Improved identification of tweets that mention books: Selection of effective features

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 10075 LNCS 150-156

DOI: 10.1007/978-3-319-49304-6_19

0Citations

4Readers

Get full text

Abstract

In this paper, we assessed the effectiveness of different types of features for the identification of tweets on Twitter that mention books among tweets that contain the same strings as full book titles. In the previous work, the bag-of-words based features were taken from the context of individual tweets. While performance was reasonable, we identified room for improvement in terms of the extraction of features. We proposed additional types of features such as words appearing in the profiles of tweet authors, POS tags of mentioned book titles, and bibliographic elements within tweets, e.g. authors and publishers. We conducted a grid search for all combinations of the above feature sets, and observed performance improvements suitable for practical applications.

Author supplied keywords

Cite

CITATION STYLE

APA

Yada, S., & Kageura, K. (2016). Improved identification of tweets that mention books: Selection of effective features. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10075 LNCS, pp. 150–156). Springer Verlag. https://doi.org/10.1007/978-3-319-49304-6_19

Improved identification of tweets that mention books: Selection of effective features

Abstract

Author supplied keywords

Cite

Register to see more suggestions