☆ 4.7 Article

JIND: joint integration and discrimination for automated single-cell annotation

BIOINFORMATICS (2022)

期刊

BIOINFORMATICS

卷 38, 期 9, 页码 2488-2495

出版社

OXFORD UNIV PRESS

DOI: 10.1093/bioinformatics/btac140

关键词

类别

Biochemical Research Methods Biotechnology & Applied Microbiology Computer Science, Interdisciplinary Applications Mathematical & Computational Biology Statistics & Probability

资金

Ramon y Cajal Grant from the Spanish Government
Basque Government
H2020 Marie S. Curie IF Action, European Commission [898356]
National Science Foundation [CCF-2046991]
Marie Curie Actions (MSCA) [898356] Funding Source: Marie Curie Actions (MSCA)

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The article introduces a neural-network-based framework JIND for automated cell-type identification. JIND performs integration in a suitably chosen space and employs a novel asymmetric alignment to address batch effects. Results demonstrate that the joint approach of integration and classification in JIND outperforms existing pipelines in accuracy and can identify cells that cannot be reliably classified.

Motivation An important step in the transcriptomic analysis of individual cells involves manually determining the cellular identities. To ease this labor-intensive annotation of cell-types, there has been a growing interest in automated cell annotation, which can be achieved by training classification algorithms on previously annotated datasets. Existing pipelines employ dataset integration methods to remove potential batch effects between source (annotated) and target (unannotated) datasets. However, the integration and classification steps are usually independent of each other and performed by different tools. We propose JIND (joint integration and discrimination for automated single-cell annotation), a neural-network-based framework for automated cell-type identification that performs integration in a space suitably chosen to facilitate cell classification. To account for batch effects, JIND performs a novel asymmetric alignment in which unseen cells are mapped onto the previously learned latent space, avoiding the need of retraining the classification model for new datasets. JIND also learns cell-type-specific confidence thresholds to identify cells that cannot be reliably classified. Results We show on several batched datasets that the joint approach to integration and classification of JIND outperforms in accuracy existing pipelines, and a smaller fraction of cells is rejected as unlabeled as a result of the cell-specific confidence thresholds. Moreover, we investigate cells misclassified by JIND and provide evidence suggesting that they could be due to outliers in the annotated datasets or errors in the original approach used for annotation of the target batch.

JIND: joint integration and discrimination for automated single-cell annotation

期刊

BIOINFORMATICS

出版社

OXFORD UNIV PRESS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

JIND: joint integration and discrimination for automated single-cell annotation

期刊

BIOINFORMATICS

出版社

OXFORD UNIV PRESS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文