Nonlinear dimensionality reduction for visualizing toxicity data: Distance-based versus topology-based approaches

8Citations
Citations of this article
23Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Over the years, a number of dimensionality reduction techniques have been proposed and used in chemoinformatics to perform nonlinear mappings. In this study, four representatives of nonlinear dimensionality reduction methods related to two different families were analyzed: distance-based approaches (Isomap and Diffusion Maps) and topology-based approaches (Generative Topographic Mapping (GTM) and Laplacian Eigenmaps). The considered methods were applied for the visualization of three toxicity datasets by using four sets of descriptors. Two methods, GTM and Diffusion Maps, were identified as the best approaches, which thus made it impossible to prioritize a single family of the considered dimensionality reduction methods. The intrinsic dimensionality assessment of data was performed by using the Maximum Likelihood Estimation. It was observed that descriptor sets with a higher intrinsic dimensionality contributed maps of lower quality. A new statistical coefficient, which combines two previously known ones, was proposed to automatically rank the maps. Instead of relying on one of the best methods, we propose to automatically generate maps with different parameter values for different descriptor sets. By following this procedure, the maps with the highest values of the introduced statistical coefficient can be automatically selected and used as a starting point for visual inspection by the user. On the map: Dimensionality reduction techniques are of great benefit during the early stages of drug discovery. A comparison between different types of chemography methods has been performed. Methods of intrinsic dimensionality were used for data analysis. Three measures have been applied for quantitative assessment of the quality of the obtained maps, and the efficiency of the different methods has been compared. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Cite

CITATION STYLE

APA

Kireeva, N. V., Ovchinnikova, S. I., Tetko, I. V., Asiri, A. M., Balakin, K. V., & Tsivadze, A. Y. (2014). Nonlinear dimensionality reduction for visualizing toxicity data: Distance-based versus topology-based approaches. ChemMedChem, 9(5), 1047–1059. https://doi.org/10.1002/cmdc.201400027

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free