Feature selection for machine learning in big data

1Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We are in the information age there by collecting very huge volume of data from diverse sources in structured, unstructured and semi structured form ranging to petabytes to exabytes of data. Data is an asset as valuable knowledge and information is hidden in such massive volumes of data. Data analytics is required to have a deeper insights and identify fine grained patterns so as to make accurate predictions enabling the improvement of decision making. Extracting knowledge from data is done by data analytics, Machine learning forms the core of it. The increase in the dimensionality of data both in terms of number of tuples and also in terms of number of features poses several challenges to the machine learning algorithms. Preprocessing of data is done as a prior step to machine learning, so feature selection is done as a preprocessing step to have the dimensionality reduction of the data and thereby removing the irrelevant features and improving the efficiency and accuracy of a machine learning algorithm. In this paper we are studying various feature selection mechanisms and analyze them whether they can be adopted to sentiment analysis of big data.

Cite

CITATION STYLE

APA

Kalpana, K., Sunil Vijaya Kumar, G., & Madhavi, K. (2019). Feature selection for machine learning in big data. International Journal of Innovative Technology and Exploring Engineering, 8(6 Special Issue 4), 332–335. https://doi.org/10.35940/ijitee.F1067.0486S419

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free