When Life Gives You Oranges: Detecting and Diagnosing Intermittent Job Failures at Mozilla
Thu 26 Aug 2021 07:20 - 07:30 - Testing—Flaky Tests Chair(s): Wei Yang
Continuous delivery of cloud systems requires constant running of jobs (build processes, tests, etc.). One issue that plagues this continuous integration (CI) process are intermittent failures - non-deterministic, false alarms that do not result from a bug in the software or job specification, but rather from issues in the underlying infrastructure. At Mozilla, such intermittent failures are called oranges as a reference to the color of the build status indicator. As such intermittent failures disrupt CI and lead to failures, they erode the developers' trust in the jobs. We present a novel approach that automatically classifies failing jobs to determine whether job execution failures arise from an actual software bug or were caused by flakiness in the job (e.g., test) or the underlying infrastructure. For this purpose, we train classification models using job telemetry data to diagnose failure patterns involving features such as runtime, cpu load, operating system version, or specific platform with high precision. In an evaluation on a set of Mozilla CI jobs, our approach achieves precision scores of 73%, on average, across all data sets with some test suites achieving precision scores good enough for fully automated classification (i.e., precision scores of up to 100%), and recall scores of 82% on average (up to 94%).
Wed 25 AugDisplayed time zone: Athens change
19:00 - 20:00 | Testing—Flaky TestsIndustry Papers / Research Papers +12h Chair(s): Wing Lam University of Illinois at Urbana-Champaign | ||
19:00 10mPaper | Flaky Test Detection in Android via Event Order Exploration Research Papers Zhen Dong National University of Singapore, Abhishek Tiwari National University of Singapore, Xiao Liang Yu National University of Singapore, Abhik Roychoudhury National University of Singapore DOI Pre-print | ||
19:10 10mPaper | Quantifying No-Fault-Found Test Failures to Prioritize Inspection of Flaky Tests at Ericsson Industry Papers DOI | ||
19:20 10mPaper | When Life Gives You Oranges: Detecting and Diagnosing Intermittent Job Failures at Mozilla Industry Papers Johannes Lampel CISPA; Saarland University, Sascha Just Microsoft, Sven Apel Saarland University, Andreas Zeller CISPA DOI Pre-print | ||
19:30 30mLive Q&A | Q&A (Testing—Flaky Tests) Research Papers |
Thu 26 AugDisplayed time zone: Athens change
07:00 - 08:00 | Testing—Flaky TestsResearch Papers / Industry Papers Chair(s): Wei Yang University of Texas at Dallas | ||
07:00 10mPaper | Flaky Test Detection in Android via Event Order Exploration Research Papers Zhen Dong National University of Singapore, Abhishek Tiwari National University of Singapore, Xiao Liang Yu National University of Singapore, Abhik Roychoudhury National University of Singapore DOI Pre-print | ||
07:10 10mPaper | Quantifying No-Fault-Found Test Failures to Prioritize Inspection of Flaky Tests at Ericsson Industry Papers DOI | ||
07:20 10mPaper | When Life Gives You Oranges: Detecting and Diagnosing Intermittent Job Failures at Mozilla Industry Papers Johannes Lampel CISPA; Saarland University, Sascha Just Microsoft, Sven Apel Saarland University, Andreas Zeller CISPA DOI Pre-print | ||
07:30 30mLive Q&A | Q&A (Testing—Flaky Tests) Research Papers |