Abstract
In today’s emerging era of data science where data plays a huge role for accurate decision making process it is very important to work on cleaned and irredundant data. As data is gathered from multiple sources it might contain anomalies, missing values etc. which needs to be removed this process is called data pre-processing. In this paper we perform data pre-processing on news popularity data set where extraction , transform and loading (ETL) is done .The outcome of the process is cleaned and refined news data set which can be used to do further analysis for knowledge discovery on popularity of news . Refined data give accurate predictions and can be better utilized in decision making process.
Cite
CITATION STYLE
Bansal, Y., & Chopra*, A. (2020). Data Cleaning Techniques for Large Data Sets. International Journal of Recent Technology and Engineering (IJRTE), 8(6), 4453–4456. https://doi.org/10.35940/ijrte.e6938.038620
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.