A structured data preprocessing method based on hybrid encoding

11Citations
Citations of this article
22Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

With the rapid development of civil aviation transportation industry, the passenger throughput in civil aviation is increasing, while the problem of flight delays is becoming more and more serious. For flight delay prediction under big data, deep learning methods can be applied to make high-precision predictions. Since data preprocessing is one of the most important parts, the method based on hybrid encoding is proposed in this paper. Firstly, the flight and meteorological data are fused with the associated primary key, Since weather data has a greater impact on flight delay. Then, the fused data is encoded according to different data types. Min-Max encoding is used for continuous features, and CatBoost encoding is adopted for discrete features respectively. Finally, the data set which has been preprocessed can be put into the deep convolutional neural network ResNet to verify the effect. The experimental results show that the prediction accuracy rate of flight delay level can reach 94.02% on the structured data set after hybrid encoding.

References Powered by Scopus

Deep residual learning for image recognition

178837Citations
N/AReaders
Get full text

Robust principal component analysis?

5662Citations
N/AReaders
Get full text

Prediction of weather-induced airline delays based on machine learning algorithms

149Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Categorical Data: Need, Encoding, Selection of Encoding Method and Its Emergence in Machine Learning Models—A Practical Review Study on Heart Disease Prediction Dataset Using Pearson Correlation

22Citations
N/AReaders
Get full text

Energy load forecasting using a dual-stage attention-based recurrent neural network

20Citations
N/AReaders
Get full text

Technology acceptance prediction of robo-advisors by machine learning

12Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Liu, C., Yang, L., & Qu, J. (2021). A structured data preprocessing method based on hybrid encoding. In Journal of Physics: Conference Series (Vol. 1738). IOP Publishing Ltd. https://doi.org/10.1088/1742-6596/1738/1/012060

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 5

100%

Readers' Discipline

Tooltip

Computer Science 3

60%

Engineering 2

40%

Save time finding and organizing research with Mendeley

Sign up for free