4.6 Article

GSPy: A new toolbox and data standard for Geophysical Datasets

Journal

FRONTIERS IN EARTH SCIENCE
Volume 10, Issue -, Pages -

Publisher

FRONTIERS MEDIA SA
DOI: 10.3389/feart.2022.907614

Keywords

data standards; NetCDF; open-source software; geophysics; airborne geophysics

Funding

  1. USGS Water Availability and Use Science Program
  2. USGS Mineral Resources Program

Ask authors/readers for more resources

The paper proposes a new geophysical standard, GS convention, that utilizes the well-established NetCDF file format and the CF metadata convention. With the accompanying open-source Python package GSPy, methods and workflows for building GS-standardized NetCDF files, importing/exporting data, preparing input files, and visualizing data and models are provided.
The diversity of geophysical methods and datatypes, as well as the isolated nature of various specialties (e.g., electromagnetic, seismic, potential fields) leads to a profusion of separate data file formats and documentation conventions. This can hinder cooperation and reduce the impact of datasets researchers have invested in heavily to collect and prepare. An open, portable, and well-supported community data standard could greatly improve the interoperability, transferability, and long-term archival of geophysical data. Airborne geophysical methods particularly need an open and accessible data standard, and they exemplify the complexity that is common in geophysical datasets where critical auxiliary information on the survey and system parameters are required to fully utilize and understand the data. Here, we propose a new Geophysical Standard, termed the GS convention, that leverages the well-established and widely used NetCDF file format and builds on the Climate and Forecasts (CF) metadata convention. We also present an accompanying open-source Python package, GSPy, to provide methods and workflows for building the GS-standardized NetCDF files, importing and exporting between common data formats, preparing input files for geophysical inversion software, and visualizing data and inverted models. By using the NetCDF format, handled through the Xarray Python package, and following the CF conventions, we standardize how metadata is recorded and directly stored with the data, from general survey and system information down to specific variable attributes. Utilizing the hierarchical nature of NetCDF, GS-formatted files are organized with a root Survey group that contains global metadata about the geophysical survey. Data are then organized into subgroups beneath Survey and are categorized as Tabular or Raster depending on the geometry and point of origin for the data. Lastly, the standard ensures consistency in constructing and tracking coordinate reference systems, which is vital for accurate portability and analysis. Development and adoption of a NetCDF-based data standard for geophysical surveys can greatly improve how these complex datasets are shared and utilized, making the data more accessible to a broader science community. The architecture of GSPy can be easily transferred to additional geophysical datatypes and methods in future releases.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available