☆ 4.7 Article

Embracing limited and imperfect training datasets: opportunities and challenges in plant disease recognition using deep learning

FRONTIERS IN PLANT SCIENCE (2023)

Journal

FRONTIERS IN PLANT SCIENCE

Volume 14, Issue -, Pages -

Publisher

FRONTIERS MEDIA SA

DOI: 10.3389/fpls.2023.1225409

Keywords

plant disease recognition; AI in Agriculture; deep learning in agriculture; smart agriculture; precision agriculture

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Recent advancements in deep learning have improved plant disease recognition, but the scarcity of high-quality training datasets hampers their practical application. This paper argues for embracing poor datasets and defines and categorizes the challenges associated with using them. It provides an overview of existing studies and approaches, enhances understanding of these challenges, and contributes to the deployment of deep learning in real-world applications.

Recent advancements in deep learning have brought significant improvements to plant disease recognition. However, achieving satisfactory performance often requires high-quality training datasets, which are challenging and expensive to collect. Consequently, the practical application of current deep learning-based methods in real-world scenarios is hindered by the scarcity of high-quality datasets. In this paper, we argue that embracing poor datasets is viable and aims to explicitly define the challenges associated with using these datasets. To delve into this topic, we analyze the characteristics of high-quality datasets, namely, large-scale images and desired annotation, and contrast them with the limited and imperfect nature of poor datasets. Challenges arise when the training datasets deviate from these characteristics. To provide a comprehensive understanding, we propose a novel and informative taxonomy that categorizes these challenges. Furthermore, we offer a brief overview of existing studies and approaches that address these challenges. We point out that our paper sheds light on the importance of embracing poor datasets, enhances the understanding of the associated challenges, and contributes to the ambitious objective of deploying deep learning in real-world applications. To facilitate the progress, we finally describe several outstanding questions and point out potential future directions. Although our primary focus is on plant disease recognition, we emphasize that the principles of embracing and analyzing poor datasets are applicable to a wider range of domains, including agriculture. Our project is public available at https://github.com/xml94/EmbracingLimitedImperfectTrainingDatasets.

Embracing limited and imperfect training datasets: opportunities and challenges in plant disease recognition using deep learning

Journal

FRONTIERS IN PLANT SCIENCE

Publisher

FRONTIERS MEDIA SA

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Embracing limited and imperfect training datasets: opportunities and challenges in plant disease recognition using deep learning

Journal

FRONTIERS IN PLANT SCIENCE

Publisher

FRONTIERS MEDIA SA

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper