4.7 Article

A Little Data Goes a Long Way: Automating Seismic Phase Arrival Picking at Nabro Volcano With Transfer Learning

Journal

Publisher

AMER GEOPHYSICAL UNION
DOI: 10.1029/2021JB021910

Keywords

volcano seismology; machine learning; transfer learning; phase arrival detection; earthquake detection; volcano monitoring

Funding

  1. NERC [NE/R017956/1, NE/R018006/1, NE/J012297/1] Funding Source: UKRI

Ask authors/readers for more resources

In this study, a deep learning model for automating phase arrival detection at Nabro volcano is developed using transfer learning with limited training data. The new model shows improved classification accuracy and arrival time residuals compared to existing models, and is more efficient in processing seismic data from multiple stations.
Supervised deep learning models have become a popular choice for seismic phase arrival detection. However, they do not always perform well on out-of-distribution data and require large training sets to aid generalization and prevent overfitting. This can present issues when using these models in new monitoring settings. In this work, we develop a deep learning model for automating phase arrival detection at Nabro volcano using a limited amount of training data (2,498 event waveforms recorded over 35 days) through a process known as transfer learning. We use the feature extraction layers of an existing, extensively trained seismic phase picking model to form the base of a new all-convolutional model, which we call U-GPD. We demonstrate that transfer learning reduces overfitting and model error relative to training the same model from scratch, particularly for small training sets (e.g., 500 waveforms). The new U-GPD model achieves greater classification accuracy and smaller arrival time residuals than off-the-shelf applications of two existing, extensively-trained baseline models for a test set of 800 event and noise waveforms from Nabro volcano. When applied to 14 months of continuous Nabro data, the new U-GPD model detects 31,387 events with at least four P-wave arrivals and one S-wave arrival, which is more than the original base model (26,808 events) and our existing manual catalog (2,926 events), with smaller location errors. The new model is also more efficient when applied as a sliding window, processing 14 months of data from seven stations in less than 4 h on a single graphics processing unit.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available