Machine learning in the analysis of biomolecular simulations

Shreyas Kaptan; Ilpo Vattulainen

ArticleOPEN ACCESS

Machine learning in the analysis of biomolecular simulations

Advances in Physics: X

DOI: 10.1080/23746149.2021.2006080

36Citations

59Readers

Abstract

Machine learning has rapidly become a key method for the analysis and organization of large-scale data in all scientific disciplines. In life sciences, the use of machine learning techniques is a particularly appealing idea since the enormous capacity of computational infrastructures generates terabytes of data through millisecond simulations of atomistic and molecular-scale biomolecular systems. Due to this explosion of data, the automation, reproducibility, and objectivity provided by machine learning methods are highly desirable features in the analysis of complex systems. In this review, we focus on the use of machine learning in biomolecular simulations. We discuss the main categories of machine learning tasks, such as dimensionality reduction, clustering, regression, and classification used in the analysis of simulation data. We then introduce the most popular classes of techniques involved in these tasks for the purpose of enhanced sampling, coordinate discovery, and structure prediction. Whenever possible, we explain the scope and limitations of machine learning approaches, and we discuss examples of applications of these techniques.

Author supplied keywords

Cite

CITATION STYLE

APA

Kaptan, S., & Vattulainen, I. (2022). Machine learning in the analysis of biomolecular simulations. Advances in Physics: X. Taylor and Francis Ltd. https://doi.org/10.1080/23746149.2021.2006080

Machine learning in the analysis of biomolecular simulations

Abstract

Author supplied keywords

Cite

Register to see more suggestions