4.6 Article

Machine Learning Prediction of Nitrification From Ammonia- and Nitrite-Oxidizer Community Structure

Journal

FRONTIERS IN MICROBIOLOGY
Volume 13, Issue -, Pages -

Publisher

FRONTIERS MEDIA SA
DOI: 10.3389/fmicb.2022.899565

Keywords

nitrifiers; machine learning; nitrification; microbiome; ammonia-oxidizers

Categories

Funding

  1. USDA AFRI grant program [2014-67019-21628]
  2. Iowa Agriculture and Home Economics Experiment Station
  3. Plant Sciences Institute at Iowa State University

Ask authors/readers for more resources

This study applied machine learning to analyze 16S rRNA sequence and nitrification potential data, aiming to accurately model nitrification and understand the role of specific taxa. The results showed that the important predictors of nitrification varied in different cropping systems and the inclusion of amoA gene abundance data did not substantially improve model performance. Additionally, several clades of nitrifiers important for nitrification in different cropping systems were identified.
Accurately modeling nitrification and understanding the role specific ammonia- or nitrite-oxidizing taxa play in it are of great interest and importance to microbial ecologists. In this study, we applied machine learning to 16S rRNA sequence and nitrification potential data from an experiment examining interactions between cropping systems and rhizosphere on microbial community assembly and nitrogen cycling processes. Given the high dimensionality of microbiome datasets, we only included nitrifers since only a few taxa are capable of ammonia and nitrite oxidation. We compared the performance of linear and nonlinear algorithms with and without qPCR measures of bacterial and archaea ammonia monooxygenase subunit A (amoA) gene abundance. Our feature selection process facilitated the identification of taxons that are most predictive of nitrification and to compare habitats. We found that Nitrosomonas and Nitrospirae were more frequently identified as important predictors of nitrification in conventional systems, whereas Thaumarchaeota were more important predictors in diversified systems. Our results suggest that model performance was not substantively improved by incorporating additional time-consuming and expensive qPCR data on amoA gene abundance. We also identified several clades of nitrifiers important for nitrification in different cropping systems, though we were unable to detect system- or rhizosphere-specific patterns in OTU-level biomarkers for nitrification. Finally, our results highlight the inherent risk of combining data from disparate habitats with the goal of increasing sample size to avoid overfitting models. This study represents a step toward developing machine learning approaches for microbiome research to identify nitrifier ecotypes that may be important for distinguishing ecotypes with defining roles in different habitats.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available