4.6 Article

DCGAN-Based Data Augmentation for Tomato Leaf Disease Identification

Journal

IEEE ACCESS
Volume 8, Issue -, Pages 98716-98728

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2020.2997001

Keywords

Diseases; Training; Generative adversarial networks; Gallium nitride; Data models; Image recognition; Neural networks; Tomato leaf disease; data augmentation; generative adversarial networks; generalization; recognition accuracy

Funding

  1. Public Welfare Industry (Agriculture) Research Projects Level-2 [201503116-04-06]
  2. Postdoctoral Foundation of Heilongjiang Province [LBHZ15020]
  3. Harbin Applied Technology Research and Development Program [2017RAQXJ096]
  4. National Key Application Research and Development Program in China [2018YFD0300105-2]

Ask authors/readers for more resources

Tomato leaf disease seriously affects the yield of tomato. It is extremely vital for agricultural economy to identify agricultural diseases. The traditional data augmentation methods, such as rotation, flip and translation, are severely limited, which cannot achieve good generalization results. To improve the recognition accuracy of tomato leaf diseases, a new method of data augmentation by generative adversarial networks (GANs) is proposed for leaf disease recognition in this work. Generated images augmented by deep convolutional generative adversarial networks (DCGAN) and original images as the input of GoogLeNet, this model can achieve a top-1 average identification accuracy of 94.33%. By adjusting the hyper-parameters, modifying the architecture of the convolutional neural networks, and selecting different generative adversarial networks, an improved model for training and testing 5 classes of tomato leaf images was obtained. Meanwhile, images generated by DCGAN not only enlarge the size of the data set, but also have the characteristics of diversity, which makes the model have a good generalization effect. We have also visually confirmed that the images generated by DCGAN have much better quality and are more convincing through the t-Distributed Stochastic Neighbor Embedding (t-SNE) and Visual Turing Test. Experiments with tomato leaf disease identification show that DCGAN can generate data that approximate to real images, which can be used to (1) provide a larger data set for the training of large neural networks, and improve the performance of the recognition model through highly discriminating image generation technology; (2) reduce the cost of data collection; (3) enhance the diversity of data and the generalization ability of the recognition models.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available