4.7 Article

Adversarial generation of gene expression data

Journal

BIOINFORMATICS
Volume 38, Issue 3, Pages 730-737

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btab035

Keywords

-

Funding

  1. 'la Caixa' Foundation [100010434, LCF/BQ/EU19/11710059]

Ask authors/readers for more resources

The study developed a method based on conditional generative adversarial networks to generate realistic transcriptomics data for Escherichia coli and humans. Results showed that the approach performed better in preserving gene expression properties compared to existing simulators, maintaining tissue- and cancer-specific attributes, and exhibiting real gene clusters and ontologies at different scales.
Motivation: High-throughput gene expression can be used to address a wide range of fundamental biological problems, but datasets of an appropriate size are often unavailable. Moreover, existing transcriptomics simulators have been criticized because they fail to emulate key properties of gene expression data. In this article, we develop a method based on a conditional generative adversarial network to generate realistic transcriptomics data for Escherichia coli and humans. We assess the performance of our approach across several tissues and cancer-types. Results: We show that our model preserves several gene expression properties significantly better than widely used simulators, such as SynTReN or GeneNetWeaver. The synthetic data preserve tissue- and cancer-specific properties of transcriptomics data. Moreover, it exhibits real gene clusters and ontologies both at local and global scales, suggesting that the model learns to approximate the gene expression manifold in a biologically meaningful way.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available