One of the approaches in the Knowledge Discovery in Databases (KDD) domain is Predictive Toxicology (PT). Its aim is to discover and represent the relationships between the chemical structure of chemical compounds and biological and toxicological processes. The challenges in real toxicology problems are big amount of the chemical descriptors and imperfect data (means noisy, redundant, incomplete, and irrelevant). The main goals in knowledge discovery field are to detect these undesirable proprieties and to eliminate or correct them. This supposes noise reduction, data cleaning and feature selection because the performance of the applied Machine Learning algorithms is strongly related with the quality of the used data. In this paper, we present some of the issues that can be performed for preparing data before the knowledge discovery process begin. © 2008 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Cocu, A., Dumitriu, L., Craciun, M., & Segal, C. (2008). A hybrid approach for data preprocessing in the QSAR problem. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5177 LNAI, pp. 565–572). Springer Verlag. https://doi.org/10.1007/978-3-540-85563-7_72
Mendeley helps you to discover research relevant for your work.