Analysis of Amazon Product Reviews Using Big Data- Apache Pig Tool

  • Pal Singh A
  • Singh G
N/ACitations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

We live in the era of digital technologies where data is increasing day by day at a very high rate. The data is further popularly classified as ‘Big Data’ because of its velocity, veracity, variety and its huge volume. This data could be unstructured, semi-structured or structured as it is divergent in nature. In this work, we would assess various categories of Amazon Product Reviews, the large datasets that contain around 144 million reviews in total. The datasets consists of Product reviews collected from Amazon, each having various numbers of attributes of 11 different categories. The motive of this work is to find and compare the ratings of the products during the lifespan of the product reviews. Another goal of this work is to help Amazon regarding the listing of the products in their database. This work aims to relate user’s ratings and reviews to discover how beneficial and good a product is [6]. User ratings are collected and are analyzed based on different categories (datasets) which gives an insight as to which product performs good and what are the problems associated to a certain non-performing product.

Cite

CITATION STYLE

APA

Pal Singh, A., & Singh, G. (2019). Analysis of Amazon Product Reviews Using Big Data- Apache Pig Tool. International Journal of Information Engineering and Electronic Business, 11(1), 11–18. https://doi.org/10.5815/ijieeb.2019.01.02

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free