Research of efficiency of technologies for data processing in tasks of big data analysis

S. E. Teryoshkin; I. N. Yakovina

Conference ProceedingsOPEN ACCESS

Research of efficiency of technologies for data processing in tasks of big data analysis

Journal of Physics: Conference Series (2020) 1441(1)

DOI: 10.1088/1742-6596/1441/1/012049

0Citations

6Readers

Abstract

The paper compares the performance of technologies for big data processing applied for anomaly detection in a database of a billing system. Experiments have been conducted to evaluate the performance for processing big data of two technologies such as Hadoop MapReduce and Apache Spark. The datasets and configuration of the applications used in the experiments are described. The experiment results are presented as a description of the relation between calculation time, data size and imbalance in datasets. Based on the obtained results, a decision making system for choosing a technology for processing data depending on the characteristics of the datasets is proposed.

References Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Teryoshkin, S. E., & Yakovina, I. N. (2020). Research of efficiency of technologies for data processing in tasks of big data analysis. In Journal of Physics: Conference Series (Vol. 1441). Institute of Physics Publishing. https://doi.org/10.1088/1742-6596/1441/1/012049

Readers over time

Readers' Seniority

Professor / Associate Prof. 2

50%

Lecturer / Post doc 1

25%

PhD / Post grad / Masters / Doc 1

25%

Readers' Discipline

Engineering 3

60%

Business, Management and Accounting 1

20%

Computer Science 1

20%

Research of efficiency of technologies for data processing in tasks of big data analysis

Abstract

References Powered by Scopus

Best trade-off point method for efficient resource provisioning in spark

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline