4.7 Article

Machine learning for pattern and waveform recognitions in terahertz image data

期刊

SCIENTIFIC REPORTS
卷 11, 期 1, 页码 -

出版社

NATURE PORTFOLIO
DOI: 10.1038/s41598-020-80761-9

关键词

-

资金

  1. Cooperative Research Program of Research Center for Development of Far-Infrared Region, University of Fukui [H30FIRDM022B]
  2. Council for Science, Technology and Innovation, Cross-ministerial Strategic Innovation Promotion Program(SIP), Materials Integration for Revolutionary Design System of Structural Materials(Funding agency: JST)

向作者/读者索取更多资源

Several machine learning techniques were tested for automated pattern and waveform recognition of terahertz time-domain spectroscopy datasets, with random forest algorithm showing good performance. Linear correlation and additional cross-validation criteria can be used to evaluate classifier quality, requiring a standardized image pre-processing procedure for different rust staging datasets. Random forest is practically the best choice for waveform recognition in terms of classification accuracy and timing.
Several machine learning (ML) techniques were tested for the feasibility of performing automated pattern and waveform recognitions of terahertz time-domain spectroscopy datasets. Out of all the ML techniques under test, it was observed that random forest statistical algorithm works well with the THz datasets in both the frequency and time domains. With such ML algorithm, a classifier can be created with less than 1% out-of-bag error for segmentation of rusted and non-rusted sample regions of the image datasets in frequency domain. The degree of linear correlation between the rusted area percentage and the image spatial resolution with terahertz frequency can be used as an additional cross-validation criteria for the evaluation of classifier quality. However, for different rust staging measured datasets, a standardized procedure of image pre-processing is necessary to create/apply a single classifier and its usage is only limited to 1 +/- 0.2 THz. Moreover, random forest is practically the best choice among the several popular ML techniques under test for waveform recognition of time-domain data in terms of classification accuracy and timing. Our results demonstrate the usefulness of random forest and several other machine learning algorithms for terahertz hyperspectral pattern recognition.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据