Choosing the metric: A simple model approach

10Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

One the earliest challenges a practitioner is faced with when using distance-based tools lies in the choice of the distance, for which there often is very few information to rely on. This chapter proposes to find a compromise between an a priori unoptimized choice (e.g. the Euclidean distance) and a fully-optimized, but computationally expensive, choice made by means of some resampling method. The compromise is found by choosing distance definition according to the results obtained with a very simple regression model-that is one which has few or no meta-parameters-and then use that distance in some other, more elaborate regression model. The rationale behind this heuristic is that the similarity measure which best reflects the notion of similarity with respect to the application should be the optimal one whatever model is used for classification or regression. This idea is tested against nine datasets and five prediction models. The results show that this approach is a reasonable compromise between the default choice and a fully-optimized choice of the metric. © 2011 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

François, D., Wertz, V., & Verleysen, M. (2011). Choosing the metric: A simple model approach. Studies in Computational Intelligence, 358, 97–115. https://doi.org/10.1007/978-3-642-20980-2_3

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free