4.7 Article

Identifying multi-layer gene regulatory modules from multi-dimensional genomic data

Journal

BIOINFORMATICS
Volume 28, Issue 19, Pages 2458-2466

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/bts476

Keywords

-

Funding

  1. National Institutes of Health [R01GM074163]
  2. National Science Foundation [0747475]
  3. Alfred P. Sloan Fellowship
  4. National Natural Science Foundation of China [11001256]
  5. 'Special Presidential Prize'-Scientific Research Foundation of the CAS
  6. Special Foundation of President of AMSS at CAS for 'Chen Jing-Run' Future Star Program
  7. Foundation for Members of Youth Innovation Promotion Association, CAS
  8. Direct For Biological Sciences
  9. Div Of Biological Infrastructure [0747475] Funding Source: National Science Foundation

Ask authors/readers for more resources

Motivation: Eukaryotic gene expression (GE) is subjected to precisely coordinated multi-layer controls, across the levels of epigenetic, transcriptional and post-transcriptional regulations. Recently, the emerging multi-dimensional genomic dataset has provided unprecedented opportunities to study the cross-layer regulatory interplay. In these datasets, the same set of samples is profiled on several layers of genomic activities, e. g. copy number variation (CNV), DNA methylation (DM), GE and microRNA expression (ME). However, suitable analysis methods for such data are currently sparse. Results: In this article, we introduced a sparse Multi-Block Partial Least Squares (sMBPLS) regression method to identify multi-dimensional regulatory modules from this new type of data. A multi-dimensional regulatory module contains sets of regulatory factors from different layers that are likely to jointly contribute to a local 'gene expression factory'. We demonstrated the performance of our method on the simulated data as well as on The Cancer Genomic Atlas Ovarian Cancer datasets including the CNV, DM, ME and GE data measured on 230 samples. We showed that majority of identified modules have significant functional and transcriptional enrichment, higher than that observed in modules identified using only a single type of genomic data. Our network analysis of the modules revealed that the CNV, DM and microRNA can have coupled impact on expression of important oncogenes and tumor suppressor genes.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available