4.6 Article

Defining the Rhizobium leguminosarum Species Complex

Journal

GENES
Volume 12, Issue 1, Pages -

Publisher

MDPI
DOI: 10.3390/genes12010111

Keywords

Rhizobium; species complex; bacterial taxonomy; core genes; housekeeping genes; average nucleotide identity; speciation; genospecies

Funding

  1. Innovation Fund Denmark [4105-00007A]
  2. European Community FP7 `Legumes for the Agriculture of Tomorrow' (LEGATO) [FP7-613551]
  3. Agence Nationale de la Recherche (GrasP)
  4. Rural and Environment Science and Analytical Services (RESAS, Scotland)
  5. 'Transition paths to sustainable legume based systems in Europe' (TRUE, EC Horizon 2020) [727973]
  6. RSF [17-76-30016, 19-16-00081, VA2I/463AC06, CLU-2018-04]
  7. SERB-DST, India [YSS/2015/000149]
  8. University of Otago
  9. CSIR, India [BSC0117]
  10. CSIR-HRDG, India [21(1023)/16/EMR-II]
  11. BBSRC [BB/L024209/1]
  12. Biotechnology and Biological Sciences Research Council [BB/L024209/1] Funding Source: researchfish
  13. Russian Science Foundation [19-16-00081] Funding Source: Russian Science Foundation

Ask authors/readers for more resources

The study identified 18 distinct genospecies within the Rhizobium leguminosarum complex (Rlc), each separated by a distinct gap in ANI values around 96%. The 16S ribosomal RNA sequence in the Rlc is diverse but not able to distinguish between genospecies. Single genes are not reliable guides to the true phylogeny of bacterial strains within the Rlc.
Bacteria currently included in Rhizobium leguminosarum are too diverse to be considered a single species, so we can refer to this as a species complex (the Rlc). We have found 429 publicly available genome sequences that fall within the Rlc and these show that the Rlc is a distinct entity, well separated from other species in the genus. Its sister taxon is R. anhuiense. We constructed a phylogeny based on concatenated sequences of 120 universal (core) genes, and calculated pairwise average nucleotide identity (ANI) between all genomes. From these analyses, we concluded that the Rlc includes 18 distinct genospecies, plus 7 unique strains that are not placed in these genospecies. Each genospecies is separated by a distinct gap in ANI values, usually at approximately 96% ANI, implying that it is a 'natural' unit. Five of the genospecies include the type strains of named species: R. laguerreae, R. sophorae, R. ruizarguesonis, R. indicum and R. leguminosarum itself. The 16S ribosomal RNA sequence is remarkably diverse within the Rlc, but does not distinguish the genospecies. Partial sequences of housekeeping genes, which have frequently been used to characterize isolate collections, can mostly be assigned unambiguously to a genospecies, but alleles within a genospecies do not always form a clade, so single genes are not a reliable guide to the true phylogeny of the strains. We conclude that access to a large number of genome sequences is a powerful tool for characterizing the diversity of bacteria, and that taxonomic conclusions should be based on all available genome sequences, not just those of type strains.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available