4.7 Article

Feature discovery in NIR spectroscopy based Rocha pear classification

期刊

EXPERT SYSTEMS WITH APPLICATIONS
卷 177, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2021.114949

关键词

Feature extraction; Feature selection; Data analysis; Classification; Machine learning

资金

  1. FCT - Fundacao para a Ciencia e a Tecnologia, Portugal, through the CEOT strategic projects [UID/Multi/00631/2013, UID/Multi/00631/2019]
  2. Calibrafruta, Lda.
  3. contract of Programa Ciencia 2008, FCT
  4. IEFP extracurricular traineeship
  5. Ideias em Caixa
  6. national funds through FCT - Foundation for Science and Technology, I.P., through IDMEC, under LAETA [UIDB/50022/2020]
  7. Fundação para a Ciência e a Tecnologia [UID/Multi/00631/2019, UID/Multi/00631/2013] Funding Source: FCT

向作者/读者索取更多资源

This study explores the use of feature engineering for preprocessing in fruit classification, as well as the division and selection of wavelength domain spectra. These methods can improve classification accuracy and reduce over-training. Experimental results show that the proposed method outperforms traditional approaches in accuracy and can identify features with physical chemistry significance.
Non-invasive techniques for automatic fruit classification are gaining importance in the global agro-industry as they allow for optimizing harvesting, storage, management, and distribution decisions. Visible, near infra-red (NIR) diffuse reflectance spectroscopy is one of the most employed techniques in such fruit classification. Typically, after the acquisition of a fruit reflectance spectrum the wavelength domain signal is preprocessed and a classifier is designed. Up to now, little or no work considered the problem of feature generation and selection of the reflectance spectrum. This work aims at filling this gap, by exploiting a feature engineering phase before the classifier. The usual approach where the classifier is fed directly with the reflectances measured at each wavelength is contrasted with the proposed division of the spectra into bands and their characterization in wavelength, frequency, and wavelength-frequency domains. Feature selection is also applied for optimizing efficiency, predictive accuracy, and for mitigating over-training. A total of 3050 Rocha pear samples from different origins and harvest years are considered. Statistical tests of hypotheses on classification results of soluble solids content - a predictor of both fruit sweetness and ripeness - show that the proposed preliminary phase of feature engineering outperforms the usual direct approach both in terms of accuracy and in the number of necessary features. Moreover, the method allows for the identification of features that are physical chemistry meaningful.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据