Skip to Main content Skip to Navigation
New interface
Journal articles

TRANSMUT-SPARK: Transformation Mutation for Apache Spark - Long Version

Abstract : This paper proposes TRANSMUT-SPARK for automating mutation testing of Big Data processing code within Spark programs. Apache Spark is an engine for Big Data Analytics/Processing that hides the inherent complexity of parallel Big Data programming. Nonetheless, programmers must cleverly combine Spark built-in functions within programs and guide the engine to use the right data management strategies to exploit the computational resources required by Big Data processing and avoid substantial production losses. Many programming details in Spark data processing code are prone to false statements that must be correctly and automatically tested. This paper explores the application of mutation testing in Spark programs, a faultbased testing technique that relies on fault simulation to evaluate and design test sets. The paper introduces TRANSMUT-SPARK for testing Spark programs by automating the most laborious steps of the process and fully executing the mutation testing process. The paper describes how the TRANSMUT-SPARK automates the mutants generation, test execution, and adequacy analysis phases of mutation testing. It also discusses the results of experiments to validate the tool and argues its scope and limitations.
Document type :
Journal articles
Complete list of metadata
Contributor : Genoveva Vargas-Solar Connect in order to contact the contributor
Submitted on : Tuesday, January 4, 2022 - 12:53:00 PM
Last modification on : Friday, September 30, 2022 - 11:34:16 AM


Files produced by the author(s)



João Batista de Souza Neto, Anamaria Martins Moreira, Genoveva Vargas-Solar, Martin A Musicante. TRANSMUT-SPARK: Transformation Mutation for Apache Spark - Long Version. Journal of : Software Testing, Verification and Reliability, In press, ⟨10.1002/stvr.1809⟩. ⟨hal-03509951⟩



Record views


Files downloads