4.5 Article

Cell morphology-based machine learning models for human cell state classification

Journal

NPJ SYSTEMS BIOLOGY AND APPLICATIONS
Volume 7, Issue 1, Pages -

Publisher

NATURE RESEARCH
DOI: 10.1038/s41540-021-00180-y

Keywords

-

Funding

  1. US National Science Foundation (NSF) CAREER grant [1351354]
  2. NSF [1361355, 2029121]
  3. Cecil H. and Ida Green Endowment
  4. University of Texas at Dallas
  5. Directorate For Engineering
  6. Div Of Chem, Bioeng, Env, & Transp Sys [1351354] Funding Source: National Science Foundation
  7. Division Of Mathematical Sciences
  8. Direct For Mathematical & Physical Scien [1361355] Funding Source: National Science Foundation
  9. Div Of Molecular and Cellular Bioscience
  10. Direct For Biological Sciences [2029121] Funding Source: National Science Foundation

Ask authors/readers for more resources

In this study, machine learning architectures were employed to classify healthy and apoptotic cells using only forward and side scatter flow cytometry information. By training the models, automated and reliable classification of healthy and apoptotic cells based on size and granularity information was achieved.
Herein, we implement and access machine learning architectures to ascertain models that differentiate healthy from apoptotic cells using exclusively forward (FSC) and side (SSC) scatter flow cytometry information. To generate training data, colorectal cancer HCT116 cells were subjected to miR-34a treatment and then classified using a conventional Annexin V/propidium iodide (PI)-staining assay. The apoptotic cells were defined as Annexin V-positive cells, which include early and late apoptotic cells, necrotic cells, as well as other dying or dead cells. In addition to fluorescent signal, we collected cell size and granularity information from the FSC and SSC parameters. Both parameters are subdivided into area, height, and width, thus providing a total of six numerical features that informed and trained our models. A collection of logistical regression, random forest, k-nearest neighbor, multilayer perceptron, and support vector machine was trained and tested for classification performance in predicting cell states using only the six aforementioned numerical features. Out of 1046 candidate models, a multilayer perceptron was chosen with 0.91 live precision, 0.93 live recall, 0.92 live f value and 0.97 live area under the ROC curve when applied on standardized data. We discuss and highlight differences in classifier performance and compare the results to the standard practice of forward and side scatter gating, typically performed to select cells based on size and/or complexity. We demonstrate that our model, a ready-to-use module for any flow cytometry-based analysis, can provide automated, reliable, and stain-free classification of healthy and apoptotic cells using exclusively size and granularity information.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available