Skip to Main content Skip to Navigation
Conference papers

JTeC: A Large Collection of Java Test Classes for Test Code Analysis and Processing

Abstract : The recent push towards test automation and test-driven development continues to scale up the dimensions of test code that needs to be maintained, analysed, and processed side-by-side with production code. As a consequence, on the one side regression testing techniques, e.g., for test suite prioritization or test case selection, capable to handle such large-scale test suites become indispensable; on the other side, as test code exposes own characteristics, specific techniques for its analysis and refactoring are actively sought. We present JTeC, a large-scale dataset of test cases that researchers can use for benchmarking the above techniques or any other type of tool expressly targeting test code. JTeC collects more than 2.5M test classes belonging to 31K+ GitHub projects and summing up to more than 430 Million SLOCs of ready-to-use real-world test code.
Document type :
Conference papers
Complete list of metadata
Contributor : Emilio Cruciani Connect in order to contact the contributor
Submitted on : Friday, December 11, 2020 - 3:44:51 PM
Last modification on : Wednesday, January 20, 2021 - 11:59:04 AM
Long-term archiving on: : Friday, March 12, 2021 - 7:57:58 PM


Files produced by the author(s)




Federico Corò, Roberto Verdecchia, Emilio Cruciani, Breno Miranda, Antonia Bertolino. JTeC: A Large Collection of Java Test Classes for Test Code Analysis and Processing. MSR 2020 - 17th International Conference on Mining Software Repositories, Jun 2020, Seoul / Virtual, South Korea. pp.578-582, ⟨10.1145/3379597.3387484⟩. ⟨hal-03007190⟩



Record views


Files downloads