4.7 Article

An ensemble machine learning method for microplastics identification with FTIR spectrum

Journal

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.jece.2022.108130

Keywords

Microplastics identification; Machine learning; FTIR; Deep learning; Data pre-processing

Funding

  1. Technological University of the Shannon (TUS) , Ireland [SFI 16/RC/3918]
  2. Technological University of the Shannon (TUS) , Ireland
  3. Science Foundation Ireland (SFI) [SFI 16/RC/3918]
  4. European Regional Development Fund

Ask authors/readers for more resources

Research has shown that microplastic pollution has been found in many coasts around the world, and Machine Learning methods like Support Vector Machines, KNearest Neighbours, and Random Forests can be used for identification and classification of microplastics using ATR-FTIR data. However, existing FTIR datasets are imbalanced and some microplastics may be contaminated, affecting the performance of ML classification algorithms. Therefore, an ensemble learning algorithm has been proposed to improve microplastic identification methods.
Microplastics (MPs) (size < 5 mm) marine pollution have been investigated and monitored by many researchers and found in many coasts around the world. These toxic chemicals make their way into human diet through food chain when aquatic organisms ingest MPs. Attenuated Total Reflection Fourier transform infrared spectroscopy (ATR-FTIR) is a very effective method to detect MPs. To provide the automatic detecting method for MPs, Numerous studies have proposed Machine Learning (ML) based methods, such as Support Vector Machines, KNearest Neighbours, and Random Forests, for identification and classification of MPs through using the ATRFTIR data. The evaluations of these ML based methods primarily focus on the average scores across all types of MPs. However, the existing FTIR datasets are normally imbalanced. Furthermore, some MPs contain the identical functional group, and some MPs may be fouled or contaminated, which will reduce the quality of FTIR data samples (e.g. lacking of peaks or creating noises). These factors will interfere the ML classification algorithms and cause the algorithms to perform differently while identifying different MPs. Hence, this work proposes an ensemble learning algorithm to exploit the advantage of different ML algorithms based on a systematic evaluation of the existing ML based MP identification approaches. A neural network is employed to fuse the outputs of chosen ML algorithms to improve the overall metrics. The evaluation results show that the proposed algorithm outperforms existing single ML based approaches.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available