4.7 Article

PAN: Personalized Annotation-Based Networks for the Prediction of Breast Cancer Relapse

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TCBB.2021.3076422

关键词

Annotations; Gene expression; Databases; Breast cancer; Ontologies; Genomics; Sociology; Personalized medicine; annotation-based networks; gene expression; breast cancer

向作者/读者索取更多资源

The manuscript introduces a novel method called PAN that transforms gene expression data into personalized networks efficiently. PAN classifiers show superior performance in predicting cancer relapse compared to gene features alone and outperform other graph-based classifiers. The study demonstrates the practical advantages of graph-based classification for high-dimensional genomic data.
The classification of clinical samples based on gene expression data is an important part of precision medicine. In this manuscript, we show how transforming gene expression data into a set of personalized (sample-specific) networks can allow us to harness existing graph-based methods to improve classifier performance. Existing approaches to personalized gene networks have the limitation that they depend on other samples in the data and must get re-computed whenever a new sample is introduced. Here, we propose a novel method, called Personalized Annotation-based Networks (PAN), that avoids this limitation by using curated annotation databases to transform gene expression data into a graph. Unlike competing methods, PANs are calculated for each sample independent of the population, making it a more efficient way to obtain single-sample networks. Using three breast cancer datasets as a case study, we show that PAN classifiers not only predict cancer relapse better than gene features alone, but also outperform PPI (protein-protein interactions) and population-level graph-based classifiers. This work demonstrates the practical advantages of graph-based classification for high-dimensional genomic data, while offering a new approach to making sample-specific networks. Supplementary information: PAN and the baselines are implemented in Python. Source code and data are available at https://github.com/thinng/PAN.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据