Component selection to optimize distance function learning in complex scientific data sets

0Citations
Citations of this article
15Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Analyzing complex scientific data, e.g., graphs and images, often requires comparison of features: regions on graphs, visual aspects of images and related metadata, some features being relatively more important. The notion of similarity for comparison is typically distance between data objects which could be expressed as distance between features. We refer to distance based on each feature as a component. Weights of components representing relative importance of features could be learned using distance function learning algorithms. However, it is seldom known which components optimize learning, given criteria such as accuracy, efficiency and simplicity. This is the problem we address. We propose and theoretically compare four component selection approaches: Maximal Path Traversal, Minimal Path Traversal, Maximal Path Traversal with Pruning and Minimal Path Traversal with Pruning. Experimental evaluation is conducted using real data from Materials Science, Nanotechnology and Bioinformatics. A trademarked software tool is developed as a highlight of this work. © 2008 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Varde, A., Bique, S., Rundensteiner, E., Brown, D., Liang, J., Sisson, R., … Sayre, B. (2008). Component selection to optimize distance function learning in complex scientific data sets. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5181 LNCS, pp. 269–282). https://doi.org/10.1007/978-3-540-85654-2_27

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free