The aim of Symbolic Data Analysis (SDA) is to provide a set of techniques to summarize large data sets into smaller ones called symbolic data tables. This paper considers a kind of symbolic data called Interval-Valued Data (IVD) which stores data intrinsic variability and/or uncertainty from the original data set. Recent works have been proposed to fit the classic linear regression model to symbolic data. However, those works do not consider the presence of symbolic data outliers. Generally, most specialists treat outliers as errors and discard them. Nevertheless, a single interval-data outlier holds significant information which should not be discarded or ignored. This work introduces a prediction method for IVD based on the symmetrical linear regression (SLR) analysis whose response model is less susceptible to the IVD outliers. The model considers a symmetrical distribution for error which allows to the model possibility of applying regular statistical hypothesis tests. © 2009 Springer Berlin Heidelberg.
CITATION STYLE
Domingues, M. A. O., De Souza, R. M. C. R., & Cysneiros, F. J. A. (2009). A symmetrical model applied to interval-valued data containing outliers with heavy-tail distribution. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5507 LNCS, pp. 19–26). https://doi.org/10.1007/978-3-642-03040-6_3
Mendeley helps you to discover research relevant for your work.