The volume and complexity of data collected by modern applications has grown significantly, leading to increasingly costly operations for both data manipulation and analysis. Sampling is an useful technique to support manager a more sensible volume in the data reduction process. Uniform sampling has been widely used but, in datasets exhibiting skewed cluster distribution, biased sampling shows better results. This paper presents the BBS - Biased Box Sampling algorithm which aims at keeping the skewed tendency of the clusters from the original data. We also present experimental results obtained with the proposed BBS algorithm. © Springer-Verlag Berlin Heidelberg 2007.
CITATION STYLE
Appel, A. P., Paterlini, A. A., De Sousa, E. P. M., Traina, A. J. M., & Traina, C. (2007). A density-biased sampling technique to improve cluster representativeness. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4702 LNAI, pp. 366–373). Springer Verlag. https://doi.org/10.1007/978-3-540-74976-9_35
Mendeley helps you to discover research relevant for your work.