4.8 Article

A biochemically-interpretable machine learning classifier for microbial GWAS

Journal

NATURE COMMUNICATIONS
Volume 11, Issue 1, Pages -

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41467-020-16310-9

Keywords

-

Funding

  1. NIAID grant [AI124316]
  2. NIGMS [GM102098]
  3. Novo Nordisk Foundation [NNF10CC1016517]

Ask authors/readers for more resources

Current machine learning classifiers have successfully been applied to whole-genome sequencing data to identify genetic determinants of antimicrobial resistance (AMR), but they lack causal interpretation. Here we present a metabolic model-based machine learning classifier, named Metabolic Allele Classifier (MAC), that uses flux balance analysis to estimate the biochemical effects of alleles. We apply the MAC to a dataset of 1595 drug-tested Mycobacterium tuberculosis strains and show that MACs predict AMR phenotypes with accuracy on par with mechanism-agnostic machine learning models (isoniazid AUC=0.93) while enabling a biochemical interpretation of the genotype-phenotype map. Interpretation of MACs for three antibiotics (pyrazinamide, para-aminosalicylic acid, and isoniazid) recapitulates known AMR mechanisms and suggest a biochemical basis for how the identified alleles cause AMR. Extending flux balance analysis to identify accurate sequence classifiers thus contributes mechanistic insights to GWAS, a field thus far dominated by mechanism-agnostic results. Current machine learning classifiers have been applied to whole-genome sequencing data to identify determinants of antimicrobial resistance, but they lack interpretability. Here the authors present a metabolic machine learning classifier that uses flux balance analysis to estimate the biochemical effects of alleles.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available