4.8 Article

Modeling multi-species RNA modification through multi-task curriculum learning

Journal

NUCLEIC ACIDS RESEARCH
Volume 49, Issue 7, Pages 3719-3734

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/nar/gkab124

Keywords

-

Funding

  1. National Natural Science Foundation of China [61872216, 81630103, 61772197, 31900862]
  2. Turing AI Institute of Nanjing
  3. National Key Research and Development Program of China [2018YFC0910404]
  4. ZhongguancunHaihua Institute for Frontier Information Technology

Ask authors/readers for more resources

N-6-methyladenosine (m(6)A) is the most prevalent modification in eukaryotic mRNAs, regulating various biological processes. The computational framework MASS, based on multi-task curriculum learning, outperforms existing methods in capturing m(6)A features across multiple species. MASS also helps elucidate m(6)A similarities and differences among species and their relationships with gene regulation properties.
N-6-methyladenosine (m(6)A) is the most pervasive modification in eukaryotic mRNAs. Numerous biological processes are regulated by this critical post-transcriptional mark, such as gene expression, RNA stability, RNA structure and translation. Recently, various experimental techniques and computational methods have been developed to characterize the transcriptome-wide landscapes of m(6)A modification for understanding its underlying mechanisms and functions in mRNA regulation. However, the experimental techniques are generally costly and time-consuming, while the existing computational models are usually designed only for m(6)A site prediction in a single-species and have significant limitations in accuracy, interpretability and generalizability. Here, we propose a highly interpretable computational framework, called MASS, based on a multi-task curriculum learning strategy to capture m(6)A features across multiple species simultaneously. Extensive computational experiments demonstrate the superior performances of MASS when compared to the state-of-the-art prediction methods. Furthermore, the contextual sequence features of m(6)A captured by MASS can be explained by the known critical binding motifs of the related RNA-binding proteins, which also help elucidate the similarity and difference among m(6)A features across species. In addition, based on the predicted m(6)A profiles, we further delineate the relationships between m(6)A and various properties of gene regulation, including gene expression, RNA stability, translation, RNA structure and histone modification. In summary, MASS may serve as a useful tool for characterizing m(6)A modification and studying its regulatory code.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available