4.7 Article

Developing a standardized but extendable framework to increase the findability of infectious disease datasets

期刊

SCIENTIFIC DATA
卷 10, 期 1, 页码 -

出版社

NATURE PORTFOLIO
DOI: 10.1038/s41597-023-01968-9

关键词

-

向作者/读者索取更多资源

Biomedical datasets are facing challenges in FAIRness due to their increasing size and storage in multiple repositories. A consortium of infectious disease researchers aimed to improve FAIRness and created a reusable metadata schema based on Schema.org to catalog datasets and computational tools. This approach enabled data discovery, increased dataset reusability, and accelerated research progress.
Biomedical datasets are increasing in size, stored in many repositories, and face challenges in FAIRness (findability, accessibility, interoperability, reusability). As a Consortium of infectious disease researchers from 15 Centers, we aim to adopt open science practices to promote transparency, encourage reproducibility, and accelerate research advances through data reuse. To improve FAIRness of our datasets and computational tools, we evaluated metadata standards across established biomedical data repositories. The vast majority do not adhere to a single standard, such as Schema.org, which is widely-adopted by generalist repositories. Consequently, datasets in these repositories are not findable in aggregation projects like Google Dataset Search. We alleviated this gap by creating a reusable metadata schema based on Schema.org and catalogued nearly 400 datasets and computational tools we collected. The approach is easily reusable to create schemas interoperable with community standards, but customized to a particular context. Our approach enabled data discovery, increased the reusability of datasets from a large research consortium, and accelerated research. Lastly, we discuss ongoing challenges with FAIRness beyond discoverability.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据