4.7 Article

NNAlign_MA; MHC Peptidome Deconvolution for Accurate MHC Binding Motif Characterization and Improved T-cell Epitope Predictions

期刊

MOLECULAR & CELLULAR PROTEOMICS
卷 18, 期 12, 页码 2459-2477

出版社

AMER SOC BIOCHEMISTRY MOLECULAR BIOLOGY INC
DOI: 10.1074/mcp.TIR119.001658

关键词

Bioinformatics; bioinformatics software; immunology; mass spectrometry; algorithms; antigen presentation; immunoinformatics; immunopeptidomics; machine learning

资金

  1. National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Services [HHSN272201200010C]
  2. Bill and Melinda Gates Foundation [OPP1078791]
  3. Science and Technology Council of Investigation (CONICET-Argentina)
  4. Bill and Melinda Gates Foundation [OPP1078791] Funding Source: Bill and Melinda Gates Foundation
  5. BBSRC [BBS/E/D/20002174] Funding Source: UKRI

向作者/读者索取更多资源

The set of peptides presented on a cell's surface by MHC molecules is poly-specific (it contains multiple sequence motifs matching the quantity of MHC molecules expressed). NNAlign_MA can exploit this type of data, by means of: (1) clustering peptides into individual specificities; (2) automatic annotation of clusters to an MHC molecule; and (3) training of a prediction model covering all MHCs present in the training set. NNAlign_MA expands MHC allelic coverage, thus improving T-cell epitope predictions. The set of peptides presented on a cell's surface by MHC molecules is known as the immunopeptidome. Current mass spectrometry technologies allow for identification of large peptidomes, and studies have proven these data to be a rich source of information for learning the rules of MHC-mediated antigen presentation. Immunopeptidomes are usually poly-specific, containing multiple sequence motifs matching the MHC molecules expressed in the system under investigation. Motif deconvolution -the process of associating each ligand to its presenting MHC molecule(s)- is therefore a critical and challenging step in the analysis of MS-eluted MHC ligand data. Here, we describe NNAlign_MA, a computational method designed to address this challenge and fully benefit from large, poly-specific data sets of MS-eluted ligands. NNAlign_MA simultaneously performs the tasks of (1) clustering peptides into individual specificities; (2) automatic annotation of each cluster to an MHC molecule; and (3) training of a prediction model covering all MHCs present in the training set. NNAlign_MA was benchmarked on large and diverse data sets, covering class I and class II data. In all cases, the method was demonstrated to outperform state-of-the-art methods, effectively expanding the coverage of alleles for which accurate predictions can be made, resulting in improved identification of both eluted ligands and T-cell epitopes. Given its high flexibility and ease of use, we expect NNAlign_MA to serve as an effective tool to increase our understanding of the rules of MHC antigen presentation and guide the development of novel T-cell-based therapeutics.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据