3.8 Proceedings Paper

Prioritize Crowdsourced Test Reports via Deep Screenshot Understanding

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/ICSE43902.2021.00090

Keywords

Crowdsourced testing; Mobile App Testing; Deep Screenshot Understanding

Funding

  1. National Key R&D Program of China [2018AAA0102302]
  2. National Natural Science Foundation of China [61802171, 61772014, 61690201]
  3. Fundamental Research Funds for the Central Universities [14380021]
  4. National Undergraduate Training Program for Innovation and Entrepreneurship [202010284073Z]

Ask authors/readers for more resources

Crowdsourced testing is increasingly popular in mobile app testing, but it poses a challenge for developers to manage the large number of test reports. This study introduces a novel approach called DEEPPRIOR that prioritizes crowdsourced test reports based on deep screenshot understanding. By representing test reports using a new feature called DEEPFEATURE, which includes widget details and bug context, the technique has shown promising results in outperforming existing methods with less overhead.
Crowdsourced testing is increasingly dominant in mobile application (app) testing, hut it is a great burden for app developers to inspect the incredible number of test reports. Many researches have been proposed to deal with test reports based only on texts or additionally simple image features. however, in mobile app testing, texts contained in test reports are condensed and the information is inadequate. Many screenshots its are included as complements that contain much richer information beyond texts. This trend motivates us to prioritize crowdsourced test reports based on a deep screenshot understanding. In this paper, we present a novel crowdsourced test report prioritization approach, namely DEEPPRIOR. We first represent the crowdsourced test reports with a novelly introduced feature, namely DEEPFEATURE, that includes all the widgets along with their tests, coordinates, types, and even intents based on the deep analysis of the app screenshots, and the textual descriptions in the crowdsourced test reports. DEEPFEATURE includes the Rug Feature, which directly describes the bugs. and the Context Feature, which depicts the thorough context of the hug. The similarity of the DEEPFEATURE is used to represent the test reports' similarity and prioritize the crowdsourced test reports. We formally define the similarity as DEEPFEATURE. We also conduct an empirical experiment to evaluate the effectiveness of the proposed technique with a large dataset group. The results show that DEEPPRIOR is promising. and it outperforms the state-of-the-art approach with less than half the overhead.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available