CloudVista: Visual cluster exploration for extreme scale data in the cloud

9Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The problem of efficient and high-quality clustering of extreme scale datasets with complex clustering structures continues to be one of the most challenging data analysis problems. An innovate use of data cloud would provide unique opportunity to address this challenge. In this paper, we propose the CloudVista framework to address (1) the problems caused by using sampling in the existing approaches and (2) the problems with the latency caused by cloud-side processing on interactive cluster visualization. The CloudVista framework aims to explore the entire large data stored in the cloud with the help of the data structure visual frame and the previously developed VISTA visualization model. The latency of processing large data is addressed by the RandGen algorithm that generates a series of related visual frames in the cloud without user's intervention, and a hierarchical exploration model supported by cloud-side subset processing. Experimental study shows this framework is effective and efficient for visually exploring clustering structures for extreme scale datasets stored in the cloud. © 2011 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Chen, K., Xu, H., Tian, F., & Guo, S. (2011). CloudVista: Visual cluster exploration for extreme scale data in the cloud. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6809 LNCS, pp. 332–350). https://doi.org/10.1007/978-3-642-22351-8_21

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free