Luigi Libero Lucio Starace, Ph.D.

Assistant Professor @ Università degli Studi di Napoli Federico II, Italy.

E2EGit: A Dataset of End-to-End Web Tests in Open Source Projects

AuthorsSergio Di Meglio, Luigi Libero Lucio Starace, Valeria Pontillo, Ruben Opdebeek, Coen De Roover, and Sergio Di Martino.
conferenceMSR 2025 - 22nd International Conference on Mining Software Repositories.

Abstract

End-to-end (E2E) testing is a software validation approach that simulates realistic user scenarios throughout the entire workflow of an application. In the context of web applications, E2E testing involves two activities: Graphic User Interface (GUI) testing, which simulates user interactions with the web app’s GUI through web browsers, and performance testing, which evaluates system workload handling. Despite its recognized importance in delivering high-quality web applications, the availability of large-scale datasets featuring real-world E2E web tests remains limited, hindering research in the field.

To address this gap, we present E2EGit, a comprehensive dataset of non-trivial open-source web projects collected on GitHub that adopt E2E testing. By analyzing over 5,000 web repositories across popular programming languages (Java, JavaScript, TypeScript, and Python), we identified 472 repositories implementing 43,670 automated Web GUI tests with popular browser automation frameworks (Selenium, Playwright, Cypress, Puppeteer), and 84 repositories that featured 271 automated performance tests implemented leveraging the most popular open-source tools (JMeter, Locust). Among these, 13 repositories implemented both types of testing for a total of 786 Web GUI tests and 61 performance tests.

Dataset

The dataset is available on Zenodo. DOI