4.6 Article

The Materials Data Facility: Data Services to Advance Materials Science Research

Journal

JOM
Volume 68, Issue 8, Pages 2045-2052

Publisher

SPRINGER
DOI: 10.1007/s11837-016-2001-3

Keywords

Materials; data publication; data management; data preservation; software as a service

Funding

  1. University of Chicago (UC)
  2. Argonne National Laboratory (ANL)
  3. National Center for Supercomputing Applications (UIUC)
  4. NIST
  5. DOE [DE-AC02-06CH11357]
  6. NIST as part of the CHiMAD project - U.S. Department of Commerce, National Institute of Standards and Technology [70NANB14H012]

Ask authors/readers for more resources

With increasingly strict data management requirements from funding agencies and institutions, expanding focus on the challenges of research replicability, and growing data sizes and heterogeneity, new data needs are emerging in the materials community. The materials data facility (MDF) operates two cloud-hosted services, data publication and data discovery, with features to promote open data sharing, self-service data publication and curation, and encourage data reuse, layered with powerful data discovery tools. The data publication service simplifies the process of copying data to a secure storage location, assigning data a citable persistent identifier, and recording custom (e.g., material, technique, or instrument specific) and automatically-extracted metadata in a registry while the data discovery service will provide advanced search capabilities (e.g., faceting, free text range querying, and full text search) against the registered data and metadata. The MDF services empower individual researchers, research projects, and institutions to (I) publish research datasets, regardless of size, from local storage, institutional data stores, or cloud storage, without involvement of third-party publishers; (II) build, share, and enforce extensible domain-specific custom metadata schemas; (III) interact with published data and metadata via representational state transfer (REST) application program interfaces (APIs) to facilitate automation, analysis, and feedback; and (IV) access a data discovery model that allows researchers to search, interrogate, and eventually build on existing published data. We describe MDF's design, current status, and future plans.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available