Abstract
Motivation: Molecular core structures and R-groups are essential concepts in drug development. Integration of these concepts with conventional graph pre-training approaches can promote deeper understanding in molecules. We propose MolPLA, a novel pre-training framework that employs masked graph contrastive learning in understanding the underlying decomposable parts in molecules that implicate their core structure and peripheral R-groups. Furthermore, we formulate an additional framework that grants MolPLA the ability to help chemists find replaceable R-groups in lead optimization scenarios. Results: Experimental results on molecular property prediction show that MolPLA exhibits predictability comparable to current state-of-the-art models. Qualitative analysis implicate that MolPLA is capable of distinguishing core and R-group sub-structures, identifying decomposable regions in molecules and contributing to lead optimization scenarios by rationally suggesting R-group replacements given various query core templates.
Cite
CITATION STYLE
Gim, M., Park, J., Park, S., Lee, S., Baek, S., Lee, J., … Kang, J. (2024). MolPLA: a molecular pretraining framework for learning cores, R-groups and their linker joints. Bioinformatics, 40, i369–i380. https://doi.org/10.1093/bioinformatics/btae256
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.