☆ 4.7 Article

Current challenges for unseen-epitope TCR interaction prediction and a new perspective derived from image classification

BRIEFINGS IN BIOINFORMATICS (2021)

期刊

BRIEFINGS IN BIOINFORMATICS

卷 22, 期 4, 页码 -

出版社

OXFORD UNIV PRESS

DOI: 10.1093/bib/bbaa318

关键词

T-cell epitope prediction; T-cell receptor; epitope specificity; immunoinformatics; convolutional neural network; deep learning

类别

Biochemical Research Methods Mathematical & Computational Biology

资金

Research Foundation Flanders (FWO) [1141217N, 1S48819N, 1861219N]
University of Antwerp
Flemish Government
Research Foundation - Flanders (FWO)
Flemish Government - department EWI

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The prediction of T-cell receptor (TCR) epitope recognition has advanced in recent years, but evaluating all possible TCR-epitope pairs is still challenging due to sequence diversity and limited training data. This work provides an overview of the current state of this problem, introduces a novel feature representation approach called ImRex, and highlights challenges specific to TCR-epitope data. Results show that appropriate feature engineering methods and rigorous benchmark standards are necessary for creating and validating TCR-epitope predictive models.

The prediction of epitope recognition by T-cell receptors (TCRs) has seen many advancements in recent years, with several methods now available that can predict recognition for a specific set of epitopes. However, the generic case of evaluating all possible TCR-epitope pairs remains challenging, mainly due to the high diversity of the interacting sequences and the limited amount of currently available training data. In this work, we provide an overview of the current state of this unsolved problem. First, we examine appropriate validation strategies to accurately assess the generalization performance of generic TCR-epitope recognition models when applied to both seen and unseen epitopes. In addition, we present a novel feature representation approach, which we call ImRex (interaction map recognition). This approach is based on the pairwise combination of physicochemical properties of the individual amino acids in the CDR3 and epitope sequences, which provides a convolutional neural network with the combined representation of both sequences. Lastly, we highlight various challenges that are specific to TCR-epitope data and that can adversely affect model performance. These include the issue of selecting negative data, the imbalanced epitope distribution of curated TCR-epitope datasets and the potential exchangeability of TCR alpha and beta chains. Our results indicate that while extrapolation to unseen epitopes remains a difficult challenge, ImRex makes this feasible for a subset of epitopes that are not too dissimilar from the training data. We show that appropriate feature engineering methods and rigorous benchmark standards are required to create and validate TCR-epitope predictive models.

Current challenges for unseen-epitope TCR interaction prediction and a new perspective derived from image classification

期刊

BRIEFINGS IN BIOINFORMATICS

出版社

OXFORD UNIV PRESS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Current challenges for unseen-epitope TCR interaction prediction and a new perspective derived from image classification

期刊

BRIEFINGS IN BIOINFORMATICS

出版社

OXFORD UNIV PRESS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文