Syntax and Themes: How Context Free Grammar Rules and Semantic Word Association Influence Book Success

0Citations
Citations of this article
41Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this paper, we attempt to improve upon the state-of-the-art in predicting a novel's success by modeling the lexical semantic relationships of its contents. We created the largest dataset used in such a project containing lexical data from 17,962 books from Project Gutenberg. We utilized domain specific feature reduction techniques to implement the most accurate models to date for predicting book success, with our best model achieving an average accuracy of 94.0%. By analyzing the model parameters, we extracted the successful semantic relationships from books of 12 different genres. We finally mapped those semantic relations to a set of themes, as defined in Roget's Thesaurus and discovered the themes that successful books of a given genre prioritize. At the end of the paper, we further showed that our model demonstrate similar performance for book success prediction even when Goodreads rating was used instead of download count to measure success.

Cite

CITATION STYLE

APA

Gorelick, H., Bijoy, B. S., Saba, S. J., Kar, S., Islam, M. S., & Amin, M. R. (2021). Syntax and Themes: How Context Free Grammar Rules and Semantic Word Association Influence Book Success. In International Conference Recent Advances in Natural Language Processing, RANLP (pp. 463–474). Incoma Ltd. https://doi.org/10.26615/978-954-452-072-4_053

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free