Implementing big data processing workflows using open source technologies

6Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In our implementation research, we apply workflow approach to the modeling and development of the Big Data processing pipeline using open source technologies. The data processing workflow is a set of interrelated steps which launch some particular jobs such as Spark job, shell job or Postgre SQL command. All workflow steps are chained to form integrated process and imitate the data load from staging storage area to the datamart storage area. The experimental workflow-based implementation of a data processing pipeline was performed that stages through different storage areas and uses actual industrial KPI dataset of some 30 millions records. Evaluation of implementation results provides proofs of the applicability of proposed workflow to other application domains and datasets which should satisfy the data format at input stage of the workflow.

References Powered by Scopus

332Citations
879Readers

This article is free to access.

11Citations
25Readers
9Citations
15Readers
Get full text

Cited by Powered by Scopus

Static code analysis tools: A systematic literature review

22Citations
61Readers
Get full text

Developing a data pipeline solution for big data processing

3Citations
26Readers
Get full text

EVALUATION OF STRATEGIES OVER STATIC CODE ANALYSIS TOOLS

2Citations
3Readers
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Suleykin, A., & Panfilov, P. (2019). Implementing big data processing workflows using open source technologies. In Annals of DAAAM and Proceedings of the International DAAAM Symposium (Vol. 30, pp. 394–404). Danube Adria Association for Automation and Manufacturing, DAAAM. https://doi.org/10.2507/30th.daaam.proceedings.054

Readers over time

‘20‘21‘22‘23‘24036912

Readers' Seniority

Tooltip

Lecturer / Post doc 2

40%

PhD / Post grad / Masters / Doc 2

40%

Professor / Associate Prof. 1

20%

Readers' Discipline

Tooltip

Computer Science 5

71%

Business, Management and Accounting 1

14%

Engineering 1

14%

Save time finding and organizing research with Mendeley

Sign up for free
0