4.8 Article

HumanMetagenomeDB: a public repository of curated and standardized metadata for human metagenomes

Journal

NUCLEIC ACIDS RESEARCH
Volume 49, Issue D1, Pages D743-D750

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/nar/gkaa1031

Keywords

-

Funding

  1. Fundacao de Amparo a Pesquisa do Estado de Sao Paulo (FAPESP) [2019/03396-9]
  2. Helmholtz Association (Germany) through the Young Investigator Group [VH-NG1248]
  3. FAPESP [2013/07375-0]
  4. Helmholtz Young Investigator Grant [VH-NG1248]
  5. Fundacao de Amparo a Pesquisa do Estado de Sao Paulo (FAPESP) [19/03396-9] Funding Source: FAPESP

Ask authors/readers for more resources

Metagenomics has become a standard strategy for understanding the functional potential of microbial communities, with an increasing number of metagenomes in public repositories. However, challenges arise from misannotated metadata in these repositories. HumanMetagenomeDB aims to simplify the identification and utilization of public human metagenomes, containing metadata from nearly 70,000 metagenomes across multiple countries, diagnoses, and age ranges.
Metagenomics became a standard strategy to comprehend the functional potential of microbial communities, including the human microbiome. Currently, the number of metagenomes in public repositories is increasing exponentially. The Sequence Read Archive (SRA) and the MG-RAST are the two main repositories for metagenomic data. These databases allow scientists to reanalyze samples and explore new hypotheses. However, mining samples from them can be a limiting factor, since the metadata available in these repositories is often misannotated, misleading, and decentralized, creating an overly complex environment for sample reanalysis. The main goal of the HumanMetagenomeDB is to simplify the identification and use of public human metagenomes of interest. Human-MetagenomeDB version 1.0 contains metadata of 69 822 metagenomes. We standardized 203 attributes, based on standardized ontologies, describing host characteristics (e.g. sex, age and body mass index), diagnosis information (e.g. cancer, Crohn's disease and Parkinson), location (e.g. country, longitude and latitude), sampling site (e.g. gut, lung and skin) and sequencing attributes (e.g. sequencing platform, average length and sequence quality). Further, HumanMetagenomeDB version 1.0 metagenomes encompass 58 countries, 9 main sample sites (i.e. body parts), 58 diagnoses and multiple ages, ranging from just born to 91 years old.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available