4.5 Article

Best practices for building and curating databases for comparative analyses

期刊

JOURNAL OF EXPERIMENTAL BIOLOGY
卷 225, 期 -, 页码 -

出版社

COMPANY BIOLOGISTS LTD
DOI: 10.1242/jeb.243295

关键词

Biomechanics; Morphology; Open science; Phylogenetics; Physiology; Systematic review

类别

资金

  1. Natural Sciences Engineering Research Council of Canada (NSERC) postdoctoral fellowship
  2. Andalusian Government

向作者/读者索取更多资源

This article provides an introduction to building databases for comparative analyses and highlights the importance and challenges of database construction. The key tips include carefully strategizing the literature search, structuring databases for multiple use, establishing version control, and emphasizing the importance of making databases accessible. Furthermore, the authors argue that curating databases with broader scopes can increase efficiency and suggest the establishment of community curation for databases.
Comparative analyses have a long history of macro-ecological and evolutionary approaches to understand structure, function, mechanism and constraint. As the pace of science accelerates, there is ever-increasing access to diverse types of data and open access databases that are enabling and inspiring new research. Whether conducting a species-level trait-based analysis or a formal meta-analysis of study effect sizes, comparative approaches share a common reliance on reliable, carefully curated databases. Unlike many scientific endeavors, building a database is a process that many researchers undertake infrequently and in which we are not formally trained. This Commentary provides an introduction to building databases for comparative analyses and highlights challenges and solutions that the authors of this Commentary have faced in their own experiences. We focus on four major tips: (1) carefully strategizing the literature search; (2) structuring databases for multiple use; (3) establishing version control within (and beyond) your study; and (4) the importance of making databases accessible. We highlight how one's approach to these tasks often depends on the goal of the study and the nature of the data. Finally, we assert that the curation of single-question databases has several disadvantages: it limits the possibility of using databases for multiple purposes and decreases efficiency due to independent researchers repeatedly sifting through large volumes of raw information. We argue that curating databases that are broader than one research question can provide a large return on investment, and that research fields could increase efficiency if community curation of databases was established.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据