Benchmarking Automated GUI Testing for Android against Real-World Bugs
Fri 27 Aug 2021 00:10 - 00:20 - Human Aspects—HCI and Mobile Chair(s): Gustavo Pinto
For ensuring the reliability of Android apps, there has been tremendous, continuous progress
on improving automated GUI testing in the past decade.
Specifically, dozens of testing techniques and tools
have been developed and demonstrated to be effective
in detecting crash bugs and outperform their respective prior work
in the number of detected crashes.
However, an overarching question ``How effectively and thoroughly
can these tools find crash bugs in practice?'' has not been
well-explored, which requires a ground-truth benchmark with real-world bugs.
Since prior studies focus on tool comparisons w.r.t some selected apps, they cannot
provide direct, in-depth answers to this question.
To complement existing work and tackle the above question,
this paper offers the first ground-truth empirical evaluation of
automated GUI testing for Android.
To this end, we devote substantial manual effort to
set up the Themis benchmark set, including (1) a carefully constructed dataset
with 52 real, reproducible crash bugs (taking
two person-months for its collection and validation), and (2)
a unified, extensible infrastructure with six
recent state-of-the-art testing tools.
The whole evaluation has taken over 10,920 CPU hours.
We find a considerable gap in these tools
finding the collected real bugs — 18 bugs cannot be detected by any tool.
Our systematic analysis further identifies five major common
challenges that these tools face, and reveals additional findings
such as factors affecting these tools in bug finding and opportunities
for tool improvements.
Overall, this work offers new concrete insights, most of which
are previously unknown/unstated and difficult to obtain.
Our study presents a new, complementary perspective from prior
studies to understand
and analyze the effectiveness of existing testing tools,
as well as a benchmark for future research on this topic.
The Themis benchmark is publicly available at
https://github.com/the-themis-benchmarks/home.
Thu 26 AugDisplayed time zone: Athens change
12:00 - 13:00 | Human Aspects—HCI and MobileResearch Papers / Industry Papers +12h Chair(s): Jürgen Cito TU Vienna; Facebook | ||
12:00 10mPaper | Data-Driven Accessibility Repair Revisited: On the Effectiveness of Generating Labels for Icons in Android Apps Research Papers Forough Mehralian University of California at Irvine, Navid Salehnamadi University of California at Irvine, Sam Malek University of California at Irvine DOI | ||
12:10 10mPaper | Benchmarking Automated GUI Testing for Android against Real-World Bugs Research Papers DOI Pre-print Media Attached | ||
12:20 10mPaper | An Empirical Study of GUI Widget Detection for Industrial Mobile Games Industry Papers Jiaming Ye Kyushu University, Ke Chen Fuxi AI Lab of Netease, Xiaofei Xie Kyushu University, Lei Ma University of Alberta, Ruochen Huang University of Alberta, Yingfeng Chen Fuxi AI Lab of Netease, Yinxing Xue University of Science and Technology of China, Jianjun Zhao Kyushu University DOI | ||
12:30 30mLive Q&A | Q&A (Human Aspects—HCI and Mobile) Research Papers |
Fri 27 AugDisplayed time zone: Athens change
00:00 - 01:00 | Human Aspects—HCI and MobileResearch Papers / Industry Papers Chair(s): Gustavo Pinto Federal University of Pará (UFPA) and Zup Innovation | ||
00:00 10mPaper | Data-Driven Accessibility Repair Revisited: On the Effectiveness of Generating Labels for Icons in Android Apps Research Papers Forough Mehralian University of California at Irvine, Navid Salehnamadi University of California at Irvine, Sam Malek University of California at Irvine DOI | ||
00:10 10mPaper | Benchmarking Automated GUI Testing for Android against Real-World Bugs Research Papers DOI Pre-print Media Attached | ||
00:20 10mPaper | An Empirical Study of GUI Widget Detection for Industrial Mobile Games Industry Papers Jiaming Ye Kyushu University, Ke Chen Fuxi AI Lab of Netease, Xiaofei Xie Kyushu University, Lei Ma University of Alberta, Ruochen Huang University of Alberta, Yingfeng Chen Fuxi AI Lab of Netease, Yinxing Xue University of Science and Technology of China, Jianjun Zhao Kyushu University DOI | ||
00:30 30mLive Q&A | Q&A (Human Aspects—HCI and Mobile) Research Papers |