4.5 Article

Automatic Waveform Quality Control for Surface Waves Using Machine Learning

Journal

SEISMOLOGICAL RESEARCH LETTERS
Volume 93, Issue 3, Pages 1683-1694

Publisher

SEISMOLOGICAL SOC AMER
DOI: 10.1785/0220210302

Keywords

-

Funding

  1. U.S. Department of Energy (DOE), Office of Fossil Energy, Carbon Storage Program through the Science-informed Machine Learning for Accelerating Real-Time Decisions in Subsurface Applications (SMART) Initiative
  2. Office of Nuclear Detonation Detection within the National Nuclear Security Administration [NA-222]
  3. U.S. DOE [DE-AC05-00OR22725]

Ask authors/readers for more resources

This article presents an automated approach using machine learning algorithms for processing quality control screening of surface-wave seismograms. By training logistic regression, support vector machines, K-nearest neighbors, random forests, and artificial neural networks with nearly 400,000 human-labeled waveforms, the authors developed two best-performing models, specifically the ANN and RF models. Testing results showed that these models agreed with labels from human analysts for seismic events in geographic regions not used for training, while requiring only 0.4% of the time. These automated processing methods can effectively reduce outliers in surface-wave-related measurements.
Surface-wave seismograms are widely used by researchers to study Earth's interior and earthquakes. To extract information reliably and robustly from a suite of surface wave-forms, the signals require quality control screening to reduce artifacts from signal com-plexity and noise. This process has usually been completed by human experts labeling each waveform visually, which is time consuming and tedious for large data sets. We explore automated approaches to improve the efficiency of waveform quality control processing by investigating logistic regression, support vector machines, K-nearest neighbors, random forests (RF), and artificial neural networks (ANN) algorithms. To speed up signal quality assessment, we trained these five machine learning (ML) meth-ods using nearly 400,000 human-labeled waveforms. The ANN and RF models outper-formed other algorithms and achieved a test accuracy of 92%. We evaluated these two best-performing models using seismic events from geographic regions not used for training. The results show that the two trained models agree with labels from human analysts but required only 0.4% of the time. Although the original (human) quality assignments assessed general waveform signal-to-noise, the ANN or RF labels can help facilitate detailed waveform analysis. Our investigations demonstrate the capability of the automated processing using these two ML models to reduce outliers in surface-wave-related measurements without human quality control screening.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available