4.5 Article

Best practices for building and curating databases for comparative analyses

Journal

JOURNAL OF EXPERIMENTAL BIOLOGY
Volume 225, Issue -, Pages -

Publisher

COMPANY BIOLOGISTS LTD
DOI: 10.1242/jeb.243295

Keywords

Biomechanics; Morphology; Open science; Phylogenetics; Physiology; Systematic review

Categories

Funding

  1. Natural Sciences Engineering Research Council of Canada (NSERC) postdoctoral fellowship
  2. Andalusian Government

Ask authors/readers for more resources

This article provides an introduction to building databases for comparative analyses and highlights the importance and challenges of database construction. The key tips include carefully strategizing the literature search, structuring databases for multiple use, establishing version control, and emphasizing the importance of making databases accessible. Furthermore, the authors argue that curating databases with broader scopes can increase efficiency and suggest the establishment of community curation for databases.
Comparative analyses have a long history of macro-ecological and evolutionary approaches to understand structure, function, mechanism and constraint. As the pace of science accelerates, there is ever-increasing access to diverse types of data and open access databases that are enabling and inspiring new research. Whether conducting a species-level trait-based analysis or a formal meta-analysis of study effect sizes, comparative approaches share a common reliance on reliable, carefully curated databases. Unlike many scientific endeavors, building a database is a process that many researchers undertake infrequently and in which we are not formally trained. This Commentary provides an introduction to building databases for comparative analyses and highlights challenges and solutions that the authors of this Commentary have faced in their own experiences. We focus on four major tips: (1) carefully strategizing the literature search; (2) structuring databases for multiple use; (3) establishing version control within (and beyond) your study; and (4) the importance of making databases accessible. We highlight how one's approach to these tasks often depends on the goal of the study and the nature of the data. Finally, we assert that the curation of single-question databases has several disadvantages: it limits the possibility of using databases for multiple purposes and decreases efficiency due to independent researchers repeatedly sifting through large volumes of raw information. We argue that curating databases that are broader than one research question can provide a large return on investment, and that research fields could increase efficiency if community curation of databases was established.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available