4.2 Article

Leaf-based species classification of hybrid cherry tomato plants by using hyperspectral imaging

Journal

JOURNAL OF NEAR INFRARED SPECTROSCOPY
Volume 31, Issue 1, Pages 41-51

Publisher

SAGE PUBLICATIONS LTD
DOI: 10.1177/09670335221148593

Keywords

Cherry tomato plants; hyperspectral imaging; mesophyll; near infrared spectral analysis; classification model

Ask authors/readers for more resources

This study developed a method using near infrared hyperspectral imaging (NIR-HSI) technology combined with machine learning to classify and identify different species of hybrid cherry tomato plants. The results showed that the linear discriminant analysis (LDA) model achieved the highest classification accuracy among the three machine learning algorithms, and the standard normal variate (SNV) preprocessing method had a greater improvement in model accuracy compared to other methods. Furthermore, using the mesophyll region or the whole leaf as regions of interest (ROI) for extracting reflectance spectra improved the classification accuracy of the leaf model. The LDA model combined with the SNV preprocessing method achieved high accuracy in both the training and test sets.
Approaches based on near infrared hyperspectral imaging (NIR-HSI) technology combined with machine learning have been developed to classify the leaves of hybrid cherry tomatoes and then identify the species of hybrid cherry tomato plants. The near infrared (NIR) hyperspectral images of 400 cherry tomato leaves (100 per species) were collected in the wavelength range of 900-1700 nm. Machine learning algorithms such as linear discriminant analysis (LDA), random forest (RF), and support vector machine (SVM) were employed to construct leaf classification models with the hyperspectral data preprocessed by Savitzky-Golay (SG) smoothing filter, first derivative (first Der) and standard normal variate (SNV). Principle of Component Analysis (PCA) was also used to reduce the data dimension and extract spectral features. It is revealed that the LDA model reaches the highest classification accuracy among the three machine learning algorithms and SNV can lead to higher improvement in model accuracy than other preprocessing methods of SG smoothing and first Der. Analysis based on PCA spectral feature extraction demonstrates that differences occur in internal material content in the leaves of cherry tomato plants with different species, which renders the models being able to distinguish between the species. Another important work was performed to reveal the different effects of the mesophyll and vein regions (VR) on the accuracy of the leaf classification model. It is demonstrated that the classification accuracy is improved by a value of 0.033 or 0.042 when mesophyll substitutes vein or whole leaf as regions of interest (ROI) to extract reflectance spectra for modeling. As a result, the accuracy of the training and test set respectively reached a high value of 0.998 and 0.973 for the LDA classification model combined with the SNV preprocessing method. The results propose that the use of mesophyll region (MR) as ROI can improve the performance of the leaf classification model, which provides a new strategy for efficient and non-destructive classification of different hybrid cherry tomato plants.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available