4.7 Article

QuantifyPoly(A): reshaping alternative polyadenylation landscapes of eukaryotes with weighted density peak clustering

Journal

BRIEFINGS IN BIOINFORMATICS
Volume 22, Issue 6, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbab268

Keywords

density peak clustering; microheterogeneity; 3 ' end sequencing; polyadenylation

Funding

  1. National Natural Science Foundation of China [61802323, 32000448, 61871463]
  2. Natural Science Foundation of Fujian Province of China [2020J01047]
  3. Fundamental Research Funds for the Central Universities in China (Xiamen University) [20720200116]

Ask authors/readers for more resources

Alternative polyadenylation, the dynamic choice of different polyadenylation sites in a gene, plays important roles in various biological processes. The QuantifyPoly(A) method accurately quantifies genome-wide polyadenylation choices and reshapes polyadenylation profiles into novel clusters, revealing dynamic usage across biological samples and species specificity.
The dynamic choice of different polyadenylation sites in a gene is referred to as alternative polyadenylation, which functions in many important biological processes. Large-scale messenger RNA 3 ' end sequencing has revealed that cleavage sites for polyadenylation are presented with microheterogeneity. To date, the conventional determination of polyadenylation site clusters is subjective and arbitrary, leading to inaccurate annotations. Here, we present a weighted density peak clustering method, QuantifyPoly(A), to accurately quantify genome-wide polyadenylation choices. Applying QuantifyPoly(A) on published 3 ' end sequencing datasets from both animals and plants, their polyadenylation profiles are reshaped into myriads of novel polyadenylation site clusters. Most of these novel polyadenylation site clusters show significantly dynamic usage across different biological samples or associate with binding sites of trans-acting factors. Upstream sequences of these clusters are enriched with polyadenylation signals UGUA, UAAA and/or AAUAAA in a species-dependent manner. Polyadenylation site clusters also exhibit species specificity, while plants ones generally show higher microheterogeneity than that of animals. QuantifyPoly(A) is broadly applicable to any types of 3 ' end sequencing data and species for accurate quantification and construction of the complex and dynamic polyadenylation landscape and enables us to decode alternative polyadenylation events invisible to conventional methods at a much higher resolution.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available