☆ 4.7 Article

A Machine Learning-Based Approach to Clinopyroxene Thermobarometry: Model Optimization and Distribution for Use in Earth Sciences

JOURNAL OF GEOPHYSICAL RESEARCH-SOLID EARTH (2022)

Journal

JOURNAL OF GEOPHYSICAL RESEARCH-SOLID EARTH

Volume 127, Issue 4, Pages -

Publisher

AMER GEOPHYSICAL UNION

DOI: 10.1029/2021JB022904

Keywords

machine learning random forest; clinopyroxene thermobarometry; model optimization

Funding

Swiss National Science Foundation [200021_184632]
European Research Council (ERC) under the European Union [677493 -FEVER]
Universita degli Studi di Perugia ENGAGE [FRB-2019]
Universite de Geneve
Swiss National Science Foundation (SNF) [200021_184632] Funding Source: Swiss National Science Foundation (SNF)

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This study presents a methodological assessment of random forest thermobarometry for clinopyroxene. The researchers found that changing the hyperparameters had little impact on the overall model performance. However, the selection method for the final value from the distribution of trees greatly affected the accuracy. The study also provides two scripts for users to apply the methodology to natural data sets and estimate uncertainties for each analysis.

Thermobarometry is a fundamental tool to quantitatively interrogate magma plumbing systems and broaden our appreciation of volcanic processes. Developments in random forest-based machine learning lend themselves to a data-driven approach to clinopyroxene thermobarometry, allowing users to access large experimental data sets that can be tailored to individual applications in Earth Sciences. We present a methodological assessment of random forest thermobarometry using the R freeware package extraTrees. We investigate the model performance, the effect of hyperparameter tuning, and assess different methods for calculating uncertainties. Deviating from the default hyperparameters used in the extraTrees package results in little difference in overall model performance (<0.2 kbar and <3 degrees C difference in standard error estimate, SEE). However, accuracy is greatly affected by how the final value from the distribution of trees in the random forest is selected (mean, median, or mode). Using the mean value leads to higher residuals between experimental and predicted P and T, whereas using median values produces smaller residuals. Additionally, this work provides two scripts for users to apply the methodology to natural data sets. The first script permits modification and filtering of the model calibration data set. The second script contains premade models, where users can rapidly input their data to recover PT estimates (SEE clinopyroxene-only model: 3.2 kbar, 72.5 degrees C and liquid-clinopyroxene model: 2.7 kbar, 44.9 degrees C). Additionally, the scripts allow the user to estimate the uncertainty for each analysis, which in some cases is significantly smaller than the reported SEE. These scripts are open source and can be accessed at .

A Machine Learning-Based Approach to Clinopyroxene Thermobarometry: Model Optimization and Distribution for Use in Earth Sciences

Journal

JOURNAL OF GEOPHYSICAL RESEARCH-SOLID EARTH

Publisher

AMER GEOPHYSICAL UNION

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

A Machine Learning-Based Approach to Clinopyroxene Thermobarometry: Model Optimization and Distribution for Use in Earth Sciences

Journal

JOURNAL OF GEOPHYSICAL RESEARCH-SOLID EARTH

Publisher

AMER GEOPHYSICAL UNION

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper