Choosing the metric: A simple model approach

Damien François; Vincent Wertz; Michel Verleysen

Journal Article

Choosing the metric: A simple model approach

Studies in Computational Intelligence (2011) 358 97-115

DOI: 10.1007/978-3-642-20980-2_3

10Citations

5Readers

Get full text

Abstract

One the earliest challenges a practitioner is faced with when using distance-based tools lies in the choice of the distance, for which there often is very few information to rely on. This chapter proposes to find a compromise between an a priori unoptimized choice (e.g. the Euclidean distance) and a fully-optimized, but computationally expensive, choice made by means of some resampling method. The compromise is found by choosing distance definition according to the results obtained with a very simple regression model-that is one which has few or no meta-parameters-and then use that distance in some other, more elaborate regression model. The rationale behind this heuristic is that the similarity measure which best reflects the notion of similarity with respect to the application should be the optimal one whatever model is used for classification or regression. This idea is tested against nine datasets and five prediction models. The results show that this approach is a reasonable compromise between the default choice and a fully-optimized choice of the metric. © 2011 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

François, D., Wertz, V., & Verleysen, M. (2011). Choosing the metric: A simple model approach. Studies in Computational Intelligence, 358, 97–115. https://doi.org/10.1007/978-3-642-20980-2_3

Choosing the metric: A simple model approach

Abstract

Cite

Register to see more suggestions