A symmetrical model applied to interval-valued data containing outliers with heavy-tail distribution

1Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The aim of Symbolic Data Analysis (SDA) is to provide a set of techniques to summarize large data sets into smaller ones called symbolic data tables. This paper considers a kind of symbolic data called Interval-Valued Data (IVD) which stores data intrinsic variability and/or uncertainty from the original data set. Recent works have been proposed to fit the classic linear regression model to symbolic data. However, those works do not consider the presence of symbolic data outliers. Generally, most specialists treat outliers as errors and discard them. Nevertheless, a single interval-data outlier holds significant information which should not be discarded or ignored. This work introduces a prediction method for IVD based on the symmetrical linear regression (SLR) analysis whose response model is less susceptible to the IVD outliers. The model considers a symmetrical distribution for error which allows to the model possibility of applying regular statistical hypothesis tests. © 2009 Springer Berlin Heidelberg.

Cite

CITATION STYLE

APA

Domingues, M. A. O., De Souza, R. M. C. R., & Cysneiros, F. J. A. (2009). A symmetrical model applied to interval-valued data containing outliers with heavy-tail distribution. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5507 LNCS, pp. 19–26). https://doi.org/10.1007/978-3-642-03040-6_3

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free