Mutation Operators for Large Scale Data Processing Programs in Spark

João Batista de Souza Neto; Anamaria Martins Moreira; Genoveva Vargas-Solar; Martin Alejandro Musicante

Conference ProceedingsOPEN ACCESS

Mutation Operators for Large Scale Data Processing Programs in Spark

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020) 12127 LNCS 482-497

DOI: 10.1007/978-3-030-49435-3_30

3Citations

9Readers

Get full text

Abstract

This paper proposes a mutation testing approach for big data processing programs that follow a data flow model, such as those implemented on top of Apache Spark. Mutation testing is a fault-based technique that relies on fault simulation by modifying programs, to create faulty versions called mutants. Mutant creation is carried on by operators able to simulate specific and well identified faults. A testing process must be able to signal faults within mutants and thereby avoid having ill behaviours within a program. We propose a set of mutation operators designed for Spark programs characterized by a data flow and data processing operations. These operators model changes in the data flow and operations, to simulate faults that take into account Spark program characteristics. We performed manual experiments to evaluate the proposed mutation operators in terms of cost and effectiveness. Thereby, we show that mutation operators can contribute to the testing process, in the construction of reliable Spark programs.

Author supplied keywords

Cite

CITATION STYLE

APA

de Souza Neto, J. B., Martins Moreira, A., Vargas-Solar, G., & Musicante, M. A. (2020). Mutation Operators for Large Scale Data Processing Programs in Spark. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12127 LNCS, pp. 482–497). Springer. https://doi.org/10.1007/978-3-030-49435-3_30

Mutation Operators for Large Scale Data Processing Programs in Spark

Abstract

Author supplied keywords

Cite

Register to see more suggestions