Research of efficiency of technologies for data processing in tasks of big data analysis

0Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

The paper compares the performance of technologies for big data processing applied for anomaly detection in a database of a billing system. Experiments have been conducted to evaluate the performance for processing big data of two technologies such as Hadoop MapReduce and Apache Spark. The datasets and configuration of the applications used in the experiments are described. The experiment results are presented as a description of the relation between calculation time, data size and imbalance in datasets. Based on the obtained results, a decision making system for choosing a technology for processing data depending on the characteristics of the datasets is proposed.

References Powered by Scopus

Best trade-off point method for efficient resource provisioning in spark

1Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Teryoshkin, S. E., & Yakovina, I. N. (2020). Research of efficiency of technologies for data processing in tasks of big data analysis. In Journal of Physics: Conference Series (Vol. 1441). Institute of Physics Publishing. https://doi.org/10.1088/1742-6596/1441/1/012049

Readers over time

‘20‘21‘2300.751.52.253

Readers' Seniority

Tooltip

Professor / Associate Prof. 2

50%

Lecturer / Post doc 1

25%

PhD / Post grad / Masters / Doc 1

25%

Readers' Discipline

Tooltip

Engineering 3

60%

Business, Management and Accounting 1

20%

Computer Science 1

20%

Save time finding and organizing research with Mendeley

Sign up for free
0