The Art of Feature Engineering: Essentials for Machine Learning

78Citations
Citations of this article
65Readers
Mendeley users who have this article in their library.
Get full text

Abstract

When working with a data set, machine learning engineers might train a model but find that the results are not as good as they need. To get better results, they can try to improve the model or collect more data, but there is another avenue: feature engineering. The feature engineering process can help improve results by modifying the data’s features to better capture the nature of the problem. This process is partly an art and partly a palette of tricks and recipes. This practical guide to feature engineering is an essential addition to any data scientist’s or machine learning engineer’s toolbox, providing new ideas on how to improve the performance of a machine learning solution. Beginning with the basic concepts and techniques of feature engineering, the text builds up to a unique cross-domain approach that spans data on graphs, texts, time series and images, with fully worked-out case studies. Key topics include binning, out-of-fold estimation, feature selection, dimensionality reduction and encoding variable-length data. The full source code for the case studies is available on a companion website as Python Jupyter notebooks.

References Powered by Scopus

Random forests

96757Citations
29772Readers

This article is free to access.

Histograms of oriented gradients for human detection

30675Citations
18112Readers
Get full text

A Computational Approach to Edge Detection

25262Citations
7194Readers
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Duboue, P. (2020). The Art of Feature Engineering: Essentials for Machine Learning. The Art of Feature Engineering: Essentials for Machine Learning (pp. 1–274). Cambridge University Press. https://doi.org/10.1017/9781108671682

Readers over time

‘20‘21‘22‘23‘24‘2505101520

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 8

42%

Lecturer / Post doc 4

21%

Researcher 4

21%

Professor / Associate Prof. 3

16%

Readers' Discipline

Tooltip

Computer Science 15

68%

Engineering 3

14%

Materials Science 2

9%

Economics, Econometrics and Finance 2

9%

Save time finding and organizing research with Mendeley

Sign up for free
0