4.7 Article

Computational Modeling of Gene-Specific Transcriptional Repression, Activation and Chromatin Interactions in Leukemogenesis by LASSO-Regularized Logistic Regression

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TCBB.2021.3078128

Keywords

Proteins; Logistics; Machine learning; Receptor (biochemistry); Regulation; Training data; Regression analysis; Logistic regression; LASSO; machine learning; regulated transcription; repression; activation; AML1-ETO

Funding

  1. NIH [R01HL093195, R21CA178513]
  2. Saint Louis University
  3. Siteman Investment Program Award (Siteman Cancer Center at Barnes-Jewish and Washington University)

Ask authors/readers for more resources

Many physiological and pathological pathways rely on gene-specific transcriptional regulation, and the use of LASSO-regularized logistic regression successfully predicted gene-specific transcriptional events and identified rate-limiting factors underlying gene activation and repression differences.
Many physiological and pathological pathways are dependent on gene-specific on/off regulation of transcription. Some genes are repressed, while others are activated. Although many previous studies have analyzed the mechanisms of gene-specific repression and activation, these studies are mainly based on the use of candidate genes, which are either repressed or activated, without simultaneously comparing and contrasting both groups of genes. There is also insufficient consideration of gene locations. Here we describe an integrated machine learning approach, using LASSO-regularized logistic regression, to model gene-specific repression and activation and the underlying contribution of chromatin interactions. LASSO-regularized logistic regression accurately predicted gene-specific transcriptional events and robustly detected the rate-limiting factors that underlie the differences of gene activation and repression. An example was provided by the leukemogenic transcription factor AML1-ETO, which is responsible for 10-15 percent of all acute myeloid leukemia cases. The analysis of AML1-ETO has also revealed novel networks of chromatin interactions and uncovered an unexpected role for E-proteins in AML1-ETO-p300 interactions and a role for the pre-existing gene state in governing the transcriptional response. Our results show that logistic regression-based probabilistic modeling is a promising tool to decipher mechanisms that integrate gene regulation and chromatin interactions in regulated transcription.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available