A scalable framework for universal data generation in parallel

Ling Gu; Minqi Zhou; Qiangqiang Kang; Aoying Zhou

Conference Proceedings

A scalable framework for universal data generation in parallel

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 8904 64-81

DOI: 10.1007/978-3-319-15350-6_5

0Citations

3Readers

Get full text

Abstract

Nowadays, more and more companies, such as Amazon, Twitter and etc., are facing the big data problem, which requires higher performance to manage tremendous large data sets. Data management systems with a new architecture taking full advantages of computer hardware are emerging, on the purpose of maximizing the system performance and fulfilling customs’ current or even future requirements. How to test performance and confirm the suitability of the new data management system becomes a primary task of these companies. Hence, how to generate a scaled data set with desired volumes and in desired velocity effectively becomes a problem imperative to be solved, together with the goal to keep the characters of their real data set as many as possible (realistic). In this paper, we proposed PSUG to generate a realistic database in terms of required volume and velocity in a scalable parallel manner. Our extensive experimental studies confirm the efficiency and effectiveness of our proposed method.

Cite

CITATION STYLE

APA

Gu, L., Zhou, M., Kang, Q., & Zhou, A. (2015). A scalable framework for universal data generation in parallel. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8904, pp. 64–81). Springer Verlag. https://doi.org/10.1007/978-3-319-15350-6_5

A scalable framework for universal data generation in parallel

Abstract

Cite

Register to see more suggestions