Speeding DBLP querying using hadoop and spark

2Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Big data is becoming bigger every day. Even for simple applications such as the Digital Bibliography & Library Project (DBLP) database, the data is becoming unmanageable using the conventional databases because of its size. Applying big data processing methods such as Hadoop and Spark is becoming more popular because of that. In this work, we investigate the use of Hadoop and Spark in the querying process of big data and we compare the performance of them in terms of their execution time. We use the DBLP database as a case study. Results show that Hadoop and Spark enhances the query execution time significantly when compared with conventional database management systems. We also found that Spark enhances the execution time over Hadoop.

Cite

CITATION STYLE

APA

Alsmirat, M., Jararweh, Y., & Al-Ayyoub, M. (2018). Speeding DBLP querying using hadoop and spark. In IOP Conference Series: Materials Science and Engineering (Vol. 459). Institute of Physics Publishing. https://doi.org/10.1088/1757-899X/459/1/012003

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free