4.6 Article

Clustering based approach for population level identification of condition-associated T-cell receptor β-chain CDR3 sequences

Journal

BMC BIOINFORMATICS
Volume 22, Issue 1, Pages -

Publisher

BMC
DOI: 10.1186/s12859-021-04087-7

Keywords

TCR differential abudance analysis; Celiac disease associated TCR clonotypes; Immune repertoire analysis; TCR repertoire analysis; Immuno-informatics; Antigen-specific TCR identification; Computational antigen-specificity identification; TCR clustering

Funding

  1. Academy of Finland
  2. European Commission (Marie Curie Excellence Grant)
  3. Sigrid Juselius Foundation
  4. Competitive State Research Financing of the Expert Area of Tampere University Hospital
  5. Tekes-the Finnish Funding Agency for Technology and Innovation

Ask authors/readers for more resources

The method proposed in this study allows for direct population level comparison of RepSeq samples using shared immune repertoire sub-units among individuals. It successfully identifies condition-associated CDR3 sequences and serves as a viable unit for immune repertoire comparison in unrelated individuals. This approach provides a promising strategy for identifying disease-relevant CDR3s and understanding immune response patterns.
BackgroundDeep immune receptor sequencing, RepSeq, provides unprecedented opportunities for identifying and studying condition-associated T-cell clonotypes, represented by T-cell receptor (TCR) CDR3 sequences. However, due to the immense diversity of the immune repertoire, identification of condition relevant TCR CDR3s from total repertoires has mostly been limited to either public CDR3 sequences or to comparisons of CDR3 frequencies observed in a single individual. A methodology for the identification of condition-associated TCR CDR3s by direct population level comparison of RepSeq samples is currently lacking.ResultsWe present a method for direct population level comparison of RepSeq samples using immune repertoire sub-units (or sub-repertoires) that are shared across individuals. The method first performs unsupervised clustering of CDR3s within each sample. It then finds matching clusters across samples, called immune sub-repertoires, and performs statistical differential abundance testing at the level of the identified sub-repertoires. It finally ranks CDR3s in differentially abundant sub-repertoires for relevance to the condition. We applied the method on total TCR CDR3 beta RepSeq datasets of celiac disease patients, as well as on public datasets of yellow fever vaccination. The method successfully identified celiac disease associated CDR3 beta sequences, as evidenced by considerable agreement of TRBV-gene and positional amino acid usage patterns in the detected CDR3 beta sequences with previously known CDR3 beta s specific to gluten in celiac disease. It also successfully recovered significantly high numbers of previously known CDR3 beta sequences relevant to each condition than would be expected by chance.ConclusionWe conclude that immune sub-repertoires of similar immuno-genomic features shared across unrelated individuals can serve as viable units of immune repertoire comparison, serving as proxy for identification of condition-associated CDR3s.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available