Journal
ANNALS OF APPLIED STATISTICS
Volume 4, Issue 2, Pages 663-686Publisher
INST MATHEMATICAL STATISTICS
DOI: 10.1214/10-AOAS350
Keywords
Transcription regulation networks; L-1 penalty; E. coli; sparse network
Categories
Funding
- NSF [DMS-07-05312, DMS-09-06784, DMS-07-05532, DMS-07-48389]
- NIH/NIGMS [GM053275-14]
- Direct For Mathematical & Physical Scien [0748389] Funding Source: National Science Foundation
- Direct For Mathematical & Physical Scien
- Division Of Mathematical Sciences [0906784] Funding Source: National Science Foundation
- Division Of Mathematical Sciences [0748389] Funding Source: National Science Foundation
Ask authors/readers for more resources
In many organisms the expression levels of each gene are controlled by the activation levels of known Transcription Factors (TF). A problem of considerable interest is that of estimating the Transcription Regulation Networks (TRN) relating the TFs and genes. While the expression levels of genes can be observed, the activation levels of the corresponding TFs are usually unknown, greatly increasing the difficulty of the problem. Based on previous experimental work, it is often the case that partial information about the TRN is available. For example, certain TFs may be known to regulate a given gene or in other cases a connection may be predicted with a certain probability. In general, the biology of the problem indicates there will be very few connections between TFs and genes. Several methods have been proposed for estimating TRNs. However, they all suffer from problems such as unrealistic assumptions about prior knowledge of the network structure or computational limitations. We propose a new approach that can directly utilize prior information about the network structure in conjunction with observed gene expression data to estimate the TRN. Our approach uses L-1 penalties on the network to ensure a sparse structure. This has the advantage of being cornputationally efficient as well as making many fewer assumptions about the network structure. We use our methodology to construct the TRN for E. coli and show that the estimate is biologically sensible and compares favorably with previous estimates.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available