4.6 Article

An impact analysis of pre-processing techniques in spectroscopy data to classify insect-damaged in soybean plants with machine and deep learning methods

期刊

INFRARED PHYSICS & TECHNOLOGY
卷 123, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.infrared.2022.104203

关键词

Field spectroscopy; Remote sensing; Precision agriculture; Artificial intelligence

资金

  1. Coordenacao de Aperfeicoamento de Pessoal de Nivel Superior (CAPES) [001]

向作者/读者索取更多资源

Spectroscopy is crucial for understanding phenomena in various fields of study, particularly in vegetation analysis. This study investigates the impact of pre-processing techniques on machine learning algorithms' performance in modeling insect damage in plants. The Extra-Tree algorithm performed the best when using first-order derivative data, and the near-infrared spectral region was identified as the most important. This research provides insights for future studies in applying pre-processing techniques and machine learning methods to spectral data.
Spectroscopy is essential to understand a series of phenomena in multiple fields of study. In remote sensing, vegetation analysis is one of the most prominent fields to explore, aiming to improve a specific task. As a task, modeling insect damage in the plants is essential to establish the correct management of agricultural farmlands. Hyperspectral data, which can be acquired with field spectroscopy at plant or leaf level, is a non-direct, rapid, and trustworthy approach to indicate its health. However, the spectral redundancy inherent is a challenge for the information extraction process, making the pre-processing phase an essential part of the analysis. Currently, artificial intelligence techniques, mostly based on machine and deep learning methods, are a standard application in data processing, being pre-processing techniques an essential part of it. But few studies aimed to measure the impact of such processes in vegetation monitoring, specifically with insect damage and spectral data. Here, we provide an analysis of the impact of pre-processing techniques on machine learning algorithms' performance over said classification task. For this, we used a field spectroradiometer that operates within the 350-1,000 nm and 1,000-2,500 nm ranges. The dataset was composed of multiple spectral measurements that took place on different days in a controlled environment with soybean plants. As pre-processing techniques, methods like baseline removal, smoothing, first and second-order derivatives, standard normal variate (SNV), multiplicative scatter correction (MSC), and principal components analysis (PCA) were investigated. Several machine learning algorithms and one deep learning method were applied to model the datasets. The impact of the pre-processing techniques was measured within validation metrics relate to its accuracy. Our results indicated that the Extra-Tree (ExT) algorithm was better, mainly when firstorder derivative data were extracted from the dataset (accuracy equal to 93.68%). A ranking approach indicated that the most contributive spectral region situates at the near-infrared, between 784 and 911 nm. Our investigation also demonstrates that a deep neural network (DNN) did not return a satisfactory result over raw reflectance data. However, when considering a combination of PCA over the 2nd derivative data, it achieved similar results to the ExT algorithm (accuracy of 91.95%). The implications of such, alongside the ranking approach, are discussed in this paper. We hope that the information presented here serves as a framework for future research when applying preprocessing techniques alongside the machine and deep learning methods over spectral data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据