Onion: Identifying Incident-Indicating Logs for Cloud Systems
Sat 28 Aug 2021 05:00 - 05:10 - Architectures & Design—Cloud Computing 2 Chair(s): Yu Kang
In cloud systems, incidents affect the availability of services and require quick mitigation actions. Once an incident occurs, operators and developers often examine logs to perform fault diagnosis. However, the large volume of diverse logs and the overwhelming details in log data make the manual diagnosis process time-consuming and error-prone. In this paper, we propose Onion, an automatic solution for precisely and efficiently locating incident-indicating logs, which can provide useful clues for diagnosing the incidents. We first point out three criteria for localizing incident-indicating logs, i.e., Consistency, Impact, and Bilateral-Difference. Then we propose a novel agglomeration of logs, called log clique, based on which these criteria are satisfied. To obtain log cliques, we develop an incident-aware log representation and a progressive log clustering technique. Contrast analysis is then performed on the cliques to identify the incident-indicating logs. We have evaluated Onion using well-labeled log datasets. Onion achieves an average F1-score of 0.95 and can process millions of logs in only a few minutes, demonstrating its effectiveness and efficiency. Onion has also been successfully applied to the cloud system of Microsoft. Its practicability has been confirmed through the quantitative and qualitative analysis of the real incident cases.
Fri 27 AugDisplayed time zone: Athens change
17:00 - 18:00 | Architectures & Design—Cloud Computing 2Industry Papers / Research Papers +12h Chair(s): Luciano Baresi Politecnico di Milano, Yu Kang Microsoft Research, Beijing, China | ||
17:00 10mPaper | Onion: Identifying Incident-Indicating Logs for Cloud Systems Industry Papers Xu Zhang Microsoft Research, Yong Xu Microsoft Research, Si Qin Microsoft Research, Shilin He Microsoft Research, Bo Qiao Microsoft Research, Ze Li Microsoft Azure, Hongyu Zhang University of Newcastle, Xukun Li Microsoft Azure, Yingnong Dang Microsoft Azure, Qingwei Lin Microsoft Research, Murali Chintalapati Microsoft Azure, Saravanakumar Rajmohan Microsoft 365, Dongmei Zhang Microsoft Research DOI | ||
17:10 10mPaper | Mono2Micro: A Practical and Effective Tool for Decomposing Monolithic Java Applications to Microservices Industry Papers Anup K. Kalia IBM Research, Jin Xiao IBM Research, Rahul Krishna IBM Research, Saurabh Sinha IBM Research, Maja Vukovic IBM Research, Debasish Banerjee IBM DOI | ||
17:20 10mPaper | RAPID: Checking API Usage for the Cloud in the Cloud Industry Papers Michael Emmi Amazon Web Services, Liana Hadarean Amazon Web Services, Ranjit Jhala University of California at San Diego; Amazon Web Services, Lee Pike Amazon Web Services, Nico Rosner Amazon Web Services, Martin Schäf Amazon Web Services, Aritra Sengupta Amazon Web Services, Willem Visser Amazon Web Services DOI | ||
17:30 30mLive Q&A | Q&A (Architectures & Design—Cloud Computing 2) Research Papers |
Sat 28 AugDisplayed time zone: Athens change
05:00 - 06:00 | Architectures & Design—Cloud Computing 2Research Papers / Industry Papers Chair(s): Yu Kang Microsoft Research, Beijing, China | ||
05:00 10mPaper | Onion: Identifying Incident-Indicating Logs for Cloud Systems Industry Papers Xu Zhang Microsoft Research, Yong Xu Microsoft Research, Si Qin Microsoft Research, Shilin He Microsoft Research, Bo Qiao Microsoft Research, Ze Li Microsoft Azure, Hongyu Zhang University of Newcastle, Xukun Li Microsoft Azure, Yingnong Dang Microsoft Azure, Qingwei Lin Microsoft Research, Murali Chintalapati Microsoft Azure, Saravanakumar Rajmohan Microsoft 365, Dongmei Zhang Microsoft Research DOI | ||
05:10 10mPaper | Mono2Micro: A Practical and Effective Tool for Decomposing Monolithic Java Applications to Microservices Industry Papers Anup K. Kalia IBM Research, Jin Xiao IBM Research, Rahul Krishna IBM Research, Saurabh Sinha IBM Research, Maja Vukovic IBM Research, Debasish Banerjee IBM DOI | ||
05:20 10mPaper | RAPID: Checking API Usage for the Cloud in the Cloud Industry Papers Michael Emmi Amazon Web Services, Liana Hadarean Amazon Web Services, Ranjit Jhala University of California at San Diego; Amazon Web Services, Lee Pike Amazon Web Services, Nico Rosner Amazon Web Services, Martin Schäf Amazon Web Services, Aritra Sengupta Amazon Web Services, Willem Visser Amazon Web Services DOI | ||
05:30 30mLive Q&A | Q&A (Architectures & Design—Cloud Computing 2) Research Papers |