4.8 Article

Data augmentation in microscopic images for material data mining

Journal

NPJ COMPUTATIONAL MATERIALS
Volume 6, Issue 1, Pages -

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41524-020-00392-6

Keywords

-

Funding

  1. National Key Research and Development Program of China [2016YFB0700500]
  2. National Science Foundation of China [51574027, 61572075, 6170203, 61873299]
  3. Finance science and technology project of Hainan province [ZDYF2019009]
  4. Fundamental Research Funds for the University of Science and Technology Beijing [FRF-BD-19-012A, FRF-TP-19-043A2]
  5. USTB MatCom of Beijing Advanced Innovation Center for Materials Genome Engineering

Ask authors/readers for more resources

Recent progress in material data mining has been driven by high-capacity models trained on large datasets. However, collecting experimental data (real data) has been extremely costly owing to the amount of human effort and expertise required. Here, we develop a novel transfer learning strategy to address problems of small or insufficient data. This strategy realizes the fusion of real and simulated data and the augmentation of training data in a data mining procedure. For a specific task of grain instance image segmentation, this strategy aims to generate synthetic data by fusing the images obtained from simulating the physical mechanism of grain formation and the image style information in real images. The results show that the model trained with the acquired synthetic data and only 35% of the real data can already achieve competitive segmentation performance of a model trained on all of the real data. Because the time required to perform grain simulation and to generate synthetic data are almost negligible as compared to the effort for obtaining real data, our proposed strategy is able to exploit the strong prediction power of deep learning without significantly increasing the experimental burden of training data preparation.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available