Data mining and computational modeling of high-throughput screening datasets

Sean Ekins; Alex M. Clark; Krishna Dole; Kellan Gregory; Andrew M. Mcnutt; Anna Coulon Spektor; Charlie Weatherall; Nadia K. Litterman; Barry A. Bunin

Book Chapter

Data mining and computational modeling of high-throughput screening datasets

Humana Press Inc., (2018), 197-221

DOI: 10.1007/978-1-4939-7724-6_14

8Citations

22Readers

Get full text

Abstract

We are now seeing the benefit of investments made over the last decade in high-throughput screening (HTS) that is resulting in large structure activity datasets entering public and open databases such as ChEMBL and PubChem. The growth of academic HTS screening centers and the increasing move to academia for early stage drug discovery suggests a great need for the informatics tools and methods to mine such data and learn from it. Collaborative Drug Discovery, Inc. (CDD) has developed a number of tools for storing, mining, securely and selectively sharing, as well as learning from such HTS data. We present a new web based data mining and visualization module directly within the CDD Vault platform for high-throughput drug discovery data that makes use of a novel technology stack following modern reactive design principles. We also describe CDD Models within the CDD Vault platform that enables researchers to share models, share predictions from models, and create models from distributed, heterogeneous data. Our system is built on top of the Collaborative Drug Discovery Vault Activity and Registration data repository ecosystem which allows users to manipulate and visualize thousands of molecules in real time. This can be performed in any browser on any platform. In this chapter we present examples of its use with public datasets in CDD Vault. Such approaches can complement other cheminformatics tools, whether open source or commercial, in providing approaches for data mining and modeling of HTS data.

Author supplied keywords

Cite

CITATION STYLE

APA

Ekins, S., Clark, A. M., Dole, K., Gregory, K., Mcnutt, A. M., Spektor, A. C., … Bunin, B. A. (2018). Data mining and computational modeling of high-throughput screening datasets. In Methods in Molecular Biology (Vol. 1755, pp. 197–221). Humana Press Inc. https://doi.org/10.1007/978-1-4939-7724-6_14

Data mining and computational modeling of high-throughput screening datasets

Abstract

Author supplied keywords

Cite

Register to see more suggestions