4.6 Article

A Computational Approach to Identification of Candidate Biomarkers in High-Dimensional Molecular Data

Journal

DIAGNOSTICS
Volume 12, Issue 8, Pages -

Publisher

MDPI
DOI: 10.3390/diagnostics12081997

Keywords

biomarker; feature selection; big data analysis; RNA-Seq; prostate adenocarcinoma

Funding

  1. SEAMO AHSC Innovation Fund
  2. NSERC [468240]

Ask authors/readers for more resources

This study introduces a novel molecular feature selection tool for identifying potential biomarkers in '-omics' data. Genes identified with this tool showed superior performance in identifying prostate adenocarcinoma subtypes with a significantly smaller set of genes. Additionally, a user-friendly graphical interface for the tool is available for free download, providing a valuable asset for biomarker discovery studies.
Complex high-dimensional datasets that are challenging to analyze are frequently produced through '-omics' profiling. Typically, these datasets contain more genomic features than samples, limiting the use of multivariable statistical and machine learning-based approaches to analysis. Therefore, effective alternative approaches are urgently needed to identify features-of-interest in '-omics' data. In this study, we present the molecular feature selection tool, a novel, ensemble-based, feature selection application for identifying candidate biomarkers in '-omics' data. As proof-of-principle, we applied the molecular feature selection tool to identify a small set of immune-related genes as potential biomarkers of three prostate adenocarcinoma subtypes. Furthermore, we tested the selected genes in a model to classify the three subtypes and compared the results to models built using all genes and all differentially expressed genes. Genes identified with the molecular feature selection tool performed better than the other models in this study in all comparison metrics: accuracy, precision, recall, and F1-score using a significantly smaller set of genes. In addition, we developed a simple graphical user interface for the molecular feature selection tool, which is available for free download. This user-friendly interface is a valuable tool for the identification of potential biomarkers in gene expression datasets and is an asset for biomarker discovery studies.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available