Abstract
The existing research on sentiment analysis mainly utilized data curated in limited geographical regions and demography (e.g., USA, UK, China) due to commercial interest and availability of review data. Since the user's attitudes and preferences can be affected by numerous sociocultural factors and demographic characteristics, it is necessary to have annotated review datasets belong to various demography. In this work, we first construct a review dataset BanglaRestaurant that contains over 2300 customer reviews towards a number of Bangladeshi restaurants. Then, we present a hybrid methodology that yields improvement over the best performing lexicon-based and machine learning (ML) based classifier without using any labeled data. Finally, we investigate how the demography (i.e., geography and nativeness in English) of users affect the linguistic characteristics of the reviews by contrasting two datasets, BanglaRestaurant and Yelp. The comparative results demonstrate the efficacy of the proposed hybrid approach. The data analysis reveals that demography plays an influential role in the linguistic aspects of reviews.
Cite
CITATION STYLE
Sazzed, S. (2021). A Hybrid Approach of Opinion Mining and Comparative Linguistic Analysis of Restaurant Reviews. In International Conference Recent Advances in Natural Language Processing, RANLP (pp. 1281–1288). Incoma Ltd. https://doi.org/10.26615/978-954-452-072-4_144
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.