Extensive evaluation of machine learning models and data preprocessings for Raman modeling in bioprocessing

31Citations
Citations of this article
36Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Raman spectroscopy is a very promising tool for monitoring key analytes in mammalian cell culture fermentations in real time. However, major challenges are associated with this promising technology in aqueous bioprocessing matrixes such as a strong background fluorescence, which is typically addressed by computational preprocessing of the raw Raman spectra. In this study, we present an extensive combinatorial assessment of various machine learning algorithms with numerous popular preprocessing methods on the performance and robustness of Raman-based real-time predictions of key analytes. We show that preprocessing methods have a large influence on algorithmic performance. Furthermore, there is a large variance across the various combinations of preprocessing steps and tested machine learning regression algorithms. We demonstrate that neural networks and random forest regression show very good performance and robustness across different, bioprocess relevant analytes. They significantly outperform partial least squares regression, the most widely used regression algorithm in the field. Overall, this extensive study provides a sound basis for building robust, next-generation models for monitoring analytes in real time based on Raman spectroscopy.

Cite

CITATION STYLE

APA

Poth, M., Magill, G., Filgertshofer, A., Popp, O., & Großkopf, T. (2022). Extensive evaluation of machine learning models and data preprocessings for Raman modeling in bioprocessing. Journal of Raman Spectroscopy, 53(9), 1580–1591. https://doi.org/10.1002/jrs.6402

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free