We define the notion of coresets for probabilistic clustering problems and propose the first (k, ε)-coreset constructions for the probabilistic k-median problem in the metric and Euclidean case. The coresets are of size poly(ε-1, k, log(W/(wmin • pmin • δ))), where W is the expected total weight of the weighted probabilistic input points, wmin is the minimum weight of a probabilistic input point, pmin is the minimum realization probability, and δ is the error probability of the construction. We show how to maintain our coreset for Euclidean spaces in data streams. © Springer-Verlag 2013.
CITATION STYLE
Lammersen, C., Schmidt, M., & Sohler, C. (2013). Probabilistic k-median clustering in data streams. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7846 LNCS, pp. 70–81). https://doi.org/10.1007/978-3-642-38016-7_7
Mendeley helps you to discover research relevant for your work.