Data Cleaning Techniques for Large Data Sets

  • Bansal Y
  • et al.
N/ACitations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In today’s emerging era of data science where data plays a huge role for accurate decision making process it is very important to work on cleaned and irredundant data. As data is gathered from multiple sources it might contain anomalies, missing values etc. which needs to be removed this process is called data pre-processing. In this paper we perform data pre-processing on news popularity data set where extraction , transform and loading (ETL) is done .The outcome of the process is cleaned and refined news data set which can be used to do further analysis for knowledge discovery on popularity of news . Refined data give accurate predictions and can be better utilized in decision making process.

Cite

CITATION STYLE

APA

Bansal, Y., & Chopra*, A. (2020). Data Cleaning Techniques for Large Data Sets. International Journal of Recent Technology and Engineering (IJRTE), 8(6), 4453–4456. https://doi.org/10.35940/ijrte.e6938.038620

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free