Opportunities and challenges of feature selection methods for high dimensional data: A review

19Citations
Citations of this article
33Readers
Mendeley users who have this article in their library.

Abstract

Now a day, all the organizations collecting huge volume of data without knowing its usefulness. The fast development of Internet helps the organizations to capture data in many different formats through Internet of Things (IoT), social media and from other disparate sources. The dimension of the dataset increases day by day at an extraordinary rate resulting in large scale dataset with high dimensionality. The present paper reviews the opportunities and challenges of feature selection for processing the high dimensional data with reduced complexity and improved accuracy. In the modern big data world the feature selection has a significance in reducing the dimensionality and overfitting of the learning process. Many feature selection methods have been proposed by researchers for obtaining more relevant features especially from the big datasets that helps to provide accurate learning results without degradation in performance. This paper discusses the importance of feature selection, basic feature selection approaches, centralized and distributed big data processing using Hadoop and Spark, challenges of feature selection and provides the summary of the related research work done by various researchers. As a result, the big data analysis with the feature selection improves the accuracy of the learning.

Cite

CITATION STYLE

APA

Subbiah, S. S., & Chinnappan, J. (2021, February 1). Opportunities and challenges of feature selection methods for high dimensional data: A review. Ingenierie Des Systemes d’Information. International Information and Engineering Technology Association. https://doi.org/10.18280/isi.260107

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free