In this paper we study the problem of clustering entities that are described by two types of data: attribute data and relationship data. While attribute data describe the inherent characteristics of the entities, relationship data represent associations among them. Attribute data can be mapped to the Euclidean space, whereas that is not always possible for the relationship data. The relationship data is described by a graph over the vertices with edges denoting relationship between pairs of entities that they connect. We study clustering problems under the model where the relationship data is constrained by 'internal connectedness,' which requires that any two entities in a cluster are connected by an internal path, that is, a path via entities only from the same cluster. We study the k-median and k-means clustering problems under this model. We show that these problems are Ω(logn) hard to approximate and give O(logn) approximation algorithms for specific cases of these problems. © 2011 Springer-Verlag.
CITATION STYLE
Gupta, N., Pancholi, A., & Sabharwal, Y. (2011). Clustering with internal connectedness. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6552 LNCS, pp. 158–169). https://doi.org/10.1007/978-3-642-19094-0_17
Mendeley helps you to discover research relevant for your work.