4.7 Article

Identifying Gene Signatures for Cancer Drug Repositioning Based on Sample Clustering

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TCBB.2020.3019781

Keywords

Drugs; Cancer; Databases; Clustering algorithms; Gene expression; Perturbation methods; Drug repositioning; sample clustering; gene signature; gene expression; drug perturbation

Funding

  1. Natural Science and Engineering Research Council of Canada (NSERC), China Scholarship Council (CSC)
  2. National Natural Science Foundation of China [61772552, 61428209]

Ask authors/readers for more resources

Drug repositioning is crucial for drug discovery, and this study proposes a new method, GS4CDRSC, based on sample clustering to identify gene signatures for cancer drug repositioning, which outperforms existing methods in predicting known drugs for specific cancers.
Drug repositioning is an important approach for drug discovery. Computational drug repositioning approaches typically use a gene signature to represent a particular disease and connect the gene signature with drug perturbation profiles. Although disease samples, especially from cancer, may be heterogeneous, most existing methods consider them as a homogeneous set to identify differentially expressed genes (DEGs)for further determining a gene signature. As a result, some genes that should be in a gene signature may be averaged off. In this study, we propose a new framework to identify gene signatures for cancer drug repositioning based on sample clustering (GS4CDRSC). GS4CDRSC first groups samples into several clusters based on their gene expression profiles. Second, an existing method is applied to the samples in each cluster for generating a list of DEGs. Then a weighting approach is used to identify an intergrated gene signature from all the lists of DEGs. The integrated gene signature is used to connect with drug perturbation profiles in the Connectivity Map (CMap)database to generate a list of drug candidates. GS4CDRSC has been tested with several cancer datasets and existing methods. The computational results show that GS4CDRSC outperforms those methods without the sample clustering and weighting approaches in terms of both number and rate of predicted known drugs for specific cancers.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available