Novel machine learning models to predict endocrine disruption activity for high-throughput chemical screening

8Citations
Citations of this article
19Readers
Mendeley users who have this article in their library.
Get full text

Abstract

An area of ongoing concern in toxicology and chemical risk assessment is endocrine disrupting chemicals (EDCs). However, thousands of legacy chemicals lack the toxicity testing required to assess their respective EDC potential, and this is where computational toxicology can play a crucial role. The US (United States) Environmental Protection Agency (EPA) has run two programs, the Collaborative Estrogen Receptor Activity Project (CERAPP) and the Collaborative Modeling Project for Receptor Activity (CoMPARA) which aim to predict estrogen and androgen activity, respectively. The US EPA solicited research groups from around the world to provide endocrine receptor activity Qualitative (or Quantitative) Structure Activity Relationship ([Q]SAR) models and then combined them to create consensus models for different toxicity endpoints. Random Forest (RF) models were developed to cover a broader range of substances with high predictive capabilities using large datasets from CERAPP and CoMPARA for estrogen and androgen activity, respectively. By utilizing simple descriptors from open-source software and large training datasets, RF models were created to expand the domain of applicability for predicting endocrine disrupting activity and help in the screening and prioritization of extensive chemical inventories. In addition, RFs were trained to conservatively predict the activity, meaning models are more likely to make false-positive predictions to minimize the number of False Negatives. This work presents twelve binary and multi-class RF models to predict binding, agonism, and antagonism for estrogen and androgen receptors. The RF models were found to have high predictive capabilities compared to other in silico modes, with some models reaching balanced accuracies of 93% while having coverage of 89%. These models are intended to be incorporated into evolving priority-setting workflows and integrated strategies to support the screening and selection of chemicals for further testing and assessment by identifying potential endocrine-disrupting substances.

References Powered by Scopus

Glide: A New Approach for Rapid, Accurate Docking and Scoring. 1. Method and Assessment of Docking Accuracy

7858Citations
N/AReaders
Get full text

Open Babel: An Open chemical toolbox

7388Citations
N/AReaders
Get full text

Endocrine-disrupting chemicals: An Endocrine Society scientific statement

3655Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Advances in QSAR through artificial intelligence and machine learning methods

7Citations
N/AReaders
Get full text

Screening of the Antagonistic Activity of Potential Bisphenol A Alternatives toward the Androgen Receptor Using Machine Learning and Molecular Dynamics Simulation

5Citations
N/AReaders
Get full text

Overcoming barriers to machine learning applications in toxicity prediction

5Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Collins, S. P., & Barton-Maclaren, T. S. (2022). Novel machine learning models to predict endocrine disruption activity for high-throughput chemical screening. Frontiers in Toxicology, 4. https://doi.org/10.3389/ftox.2022.981928

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 4

44%

Researcher 3

33%

Professor / Associate Prof. 2

22%

Readers' Discipline

Tooltip

Pharmacology, Toxicology and Pharmaceut... 3

50%

Chemistry 2

33%

Medicine and Dentistry 1

17%

Save time finding and organizing research with Mendeley

Sign up for free