4.3 Article

Unsupervised Machine Learning Via Transfer Learning and k-Means Clustering to Classify Materials Image Data

Journal

Publisher

SPRINGER HEIDELBERG
DOI: 10.1007/s40192-021-00205-8

Keywords

Computer vision; Transfer learning; Image classification; Convolutional neural network; Unsupervised machine learning

Funding

  1. National Science Foundation [CMMI-1826218]
  2. Air Force Research Laboratory [FA8650-19-2-5209]

Ask authors/readers for more resources

Unsupervised machine learning is used to classify images in a microstructural dataset with high accuracy without the need for labeled training data. By extracting features and conducting clustering on the Northeastern University Steel Surface Defects Database, the approach outperforms previous studies in terms of both performance and utility. The results of a sensitivity analysis provide insights for applying unsupervised machine learning techniques to materials science problems of interest.
Unsupervised machine learning offers significant opportunities for extracting knowledge from unlabeled datasets and for achieving maximum machine learning performance. This paper demonstrates how to construct, use, and evaluate a high-performance unsupervised machine learning system for classifying images in a popular microstructural dataset. The Northeastern University Steel Surface Defects Database includes micrographs of six different defects observed on hot-rolled steel in a format that is convenient for training and evaluating models for image classification. We use the VGG16 convolutional neural network pre-trained on the ImageNet dataset of natural images to extract feature representations for each micrograph. After applying principal component analysis to extract signal from the feature descriptors, we use k-means clustering to classify the images without needing labeled training data. The approach achieves 99.4% +/- 0.16% accuracy, and the resulting model can be used to classify new images without retraining. This approach demonstrates an improvement in both performance and utility compared to a previous study. A sensitivity analysis is conducted to better understand the influence of each step on the classification performance. The results provide insight toward applying unsupervised machine learning techniques to problems of interest in materials science.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available