One of the fundamental technology used in Big Data Analytics is the distributed computing. The traditional distributed computing technology has been adapted to create a new class of distributed computing platform and software components that make the big data analytics easier to implement. In this chapter, we discuss few of these technologies. First, we discuss the distributed database technology and how this technology has been adapted to develop no-SQL database technologies. Following this, we discuss the distributed file system (HDFS) and distributed computing technology such as map-reduce and spark. We discuss how the distributed storage and distributed computing has impacted the machine learning platforms for big data. Next, we discuss the distributed search platform and how such search platform can be used for data analytics on textual documents. We also describe the distributed communication platform such as message queue and message processing software. The data visualization technology is also changing with the big data. So lastly we introduce readers to few newer data visualization platforms targeted for big data.
CITATION STYLE
Dutta, K. (2017). Distributed Computing Technologies in Big Data Analytics (pp. 57–82). https://doi.org/10.1007/978-3-319-59834-5_4
Mendeley helps you to discover research relevant for your work.