Augmenting a training dataset of the generative diffusion model for molecular docking with artificial binding pockets

5Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.

Abstract

This study introduces the PocketCFDM generative diffusion model, aimed at improving the prediction of small molecule poses in the protein binding pockets. The model utilizes a novel data augmentation technique, involving the creation of numerous artificial binding pockets that mimic the statistical patterns of non-bond interactions found in actual protein-ligand complexes. An algorithmic method was developed to assess and replicate these interaction patterns in the artificial binding pockets built around small molecule conformers. It is shown that the integration of artificial binding pockets into the training process significantly enhanced the model's performance. Notably, PocketCFDM surpassed DiffDock in terms of non-bond interaction and steric clash numbers, and the inference speed. Future developments and optimizations of the model are discussed. The inference code and final model weights of PocketCFDM are accessible publicly via the GitHub repository: https://github.com/vtarasv/pocket-cfdm.git.

Cite

CITATION STYLE

APA

Voitsitskyi, T., Bdzhola, V., Stratiichuk, R., Koleiev, I., Ostrovsky, Z., Vozniak, V., … Starosyla, S. (2024). Augmenting a training dataset of the generative diffusion model for molecular docking with artificial binding pockets. RSC Advances, 14(2), 1341–1353. https://doi.org/10.1039/d3ra08147h

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free