We consider the problem of training models to predict sequential processes. We use two econometric datasets to demonstrate how different losses and learning algorithms alter the predictive power for a variety of state-of-the-art models. We investigate how the choice of loss function impacts model training and find that no single algorithm or loss function results in optimal predictive performance. For small datasets, neural models prove especially sensitive to training parameters, including choice of loss function and pre-processing steps. We find that a recursively-applied artificial neural network trained under L1 loss performs best under many different metrics on a national retail sales dataset, whereas a differenced autoregressive model trained under L1 loss performs best under a variety of metrics on an e-commerce dataset. We note that different training metrics and processing steps result in appreciably different performance across all model classes and argue for an adaptive approach to model fitting.
CITATION STYLE
Burkhart, M. C., & Modarresi, K. (2019). Determining Adaptive Loss Functions and Algorithms for Predictive Models. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11537 LNCS, pp. 595–607). Springer Verlag. https://doi.org/10.1007/978-3-030-22741-8_42
Mendeley helps you to discover research relevant for your work.