Distributed Computing Technologies in Big Data Analytics

  • Dutta K
N/ACitations
Citations of this article
26Readers
Mendeley users who have this article in their library.
Get full text

Abstract

One of the fundamental technology used in Big Data Analytics is the distributed computing. The traditional distributed computing technology has been adapted to create a new class of distributed computing platform and software components that make the big data analytics easier to implement. In this chapter, we discuss few of these technologies. First, we discuss the distributed database technology and how this technology has been adapted to develop no-SQL database technologies. Following this, we discuss the distributed file system (HDFS) and distributed computing technology such as map-reduce and spark. We discuss how the distributed storage and distributed computing has impacted the machine learning platforms for big data. Next, we discuss the distributed search platform and how such search platform can be used for data analytics on textual documents. We also describe the distributed communication platform such as message queue and message processing software. The data visualization technology is also changing with the big data. So lastly we introduce readers to few newer data visualization platforms targeted for big data.

Cite

CITATION STYLE

APA

Dutta, K. (2017). Distributed Computing Technologies in Big Data Analytics (pp. 57–82). https://doi.org/10.1007/978-3-319-59834-5_4

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free