QMugs, quantum mechanical properties of drug-like molecules

76Citations
Citations of this article
98Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Machine learning approaches in drug discovery, as well as in other areas of the chemical sciences, benefit from curated datasets of physical molecular properties. However, there currently is a lack of data collections featuring large bioactive molecules alongside first-principle quantum chemical information. The open-access QMugs (Quantum-Mechanical Properties of Drug-like Molecules) dataset fills this void. The QMugs collection comprises quantum mechanical properties of more than 665 k biologically and pharmacologically relevant molecules extracted from the ChEMBL database, totaling ~2 M conformers. QMugs contains optimized molecular geometries and thermodynamic data obtained via the semi-empirical method GFN2-xTB. Atomic and molecular properties are provided on both the GFN2-xTB and on the density-functional levels of theory (DFT, ωB97X-D/def2-SVP). QMugs features molecules of significantly larger size than previously-reported collections and comprises their respective quantum mechanical wave functions, including DFT density and orbital matrices. This dataset is intended to facilitate the development of models that learn from molecular data on different levels of theory while also providing insight into the corresponding relationships between molecular structure and biological activity.

References Powered by Scopus

Balanced basis sets of split valence, triple zeta valence and quadruple zeta valence quality for H to Rn: Design and assessment of accuracy

22734Citations
N/AReaders
Get full text

Array programming with NumPy

14042Citations
N/AReaders
Get full text

Least Squares Quantization in PCM

11668Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Leveraging large language models for predictive chemistry

109Citations
N/AReaders
Get full text

Structure-based drug design with geometric deep learning

81Citations
N/AReaders
Get full text

SPICE, A Dataset of Drug-like Molecules and Peptides for Training Machine Learning Potentials

70Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Isert, C., Atz, K., Jiménez-Luna, J., & Schneider, G. (2022). QMugs, quantum mechanical properties of drug-like molecules. Scientific Data, 9(1). https://doi.org/10.1038/s41597-022-01390-7

Readers' Seniority

Tooltip

Researcher 25

43%

PhD / Post grad / Masters / Doc 24

41%

Professor / Associate Prof. 8

14%

Lecturer / Post doc 1

2%

Readers' Discipline

Tooltip

Chemistry 31

65%

Biochemistry, Genetics and Molecular Bi... 7

15%

Pharmacology, Toxicology and Pharmaceut... 5

10%

Computer Science 5

10%

Article Metrics

Tooltip
Mentions
News Mentions: 1
Social Media
Shares, Likes & Comments: 6

Save time finding and organizing research with Mendeley

Sign up for free