4.6 Article Proceedings Paper

Cell type discovery and representation in the era of high-content single cell phenotyping

Journal

BMC BIOINFORMATICS
Volume 18, Issue -, Pages -

Publisher

BMC
DOI: 10.1186/s12859-017-1977-1

Keywords

Cell ontology; Single cell transcriptomics; Cell phenotype; Peripheral blood mononuclear cells; Neuron; Next generation sequencing; Cytometry; Open biomedical ontologies; Marker genes

Funding

  1. Allen Institute for Brain Science
  2. JCVI Innovation Fund
  3. U.S. National Institutes of Health [R21-AI122100, U19-AI118626]
  4. California Institute for Regenerative Medicine [GC1R-06673-B]

Ask authors/readers for more resources

Background: A fundamental characteristic of multicellular organisms is the specialization of functional cell types through the process of differentiation. These specialized cell types not only characterize the normal functioning of different organs and tissues, they can also be used as cellular biomarkers of a variety of different disease states and therapeutic/vaccine responses. In order to serve as a reference for cell type representation, the Cell Ontology has been developed to provide a standard nomenclature of defined cell types for comparative analysis and biomarker discovery. Historically, these cell types have been defined based on unique cellular shapes and structures, anatomic locations, and marker protein expression. However, we are now experiencing a revolution in cellular characterization resulting from the application of new high-throughput, high-content cytometry and sequencing technologies. The resulting explosion in the number of distinct cell types being identified is challenging the current paradigm for cell type definition in the Cell Ontology. Results: In this paper, we provide examples of state-of-the-art cellular biomarker characterization using highcontent cytometry and single cell RNA sequencing, and present strategies for standardized cell type representations based on the data outputs from these cutting-edge technologies, including context annotations in the form of standardized experiment metadata about the specimen source analyzed and marker genes that serve as the most useful features in machine learning-based cell type classification models. We also propose a statistical strategy for comparing new experiment data to these standardized cell type representations. Conclusion: The advent of high-throughput/high-content single cell technologies is leading to an explosion in the number of distinct cell types being identified. It will be critical for the bioinformatics community to develop and adopt data standard conventions that will be compatible with these new technologies and support the data representation needs of the research community. The proposals enumerated here will serve as a useful starting point to address these challenges.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available