4.7 Article

PolyAtailor: measuring poly(A) tail length from short-read and long-read sequencing data

Journal

BRIEFINGS IN BIOINFORMATICS
Volume 23, Issue 4, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbac271

Keywords

polyadenylation tail; alternative polyadenylation; next generation sequencing; third-generation sequencing; RNA processing; software

Funding

  1. [61871463toX]

Ask authors/readers for more resources

Poly(A) tails play an essential role in regulating gene expression, and there is currently a lack of easy-to-use tools for measuring poly(A) tails in different sequencing protocols. In this study, we developed PolyAtailor, a unified and efficient framework for identifying and analyzing poly(A) tails, and compared its performance with other methods.
The poly(A) tail is a dynamic addition to the eukaryotic mRNA and the change in its length plays an essential role in regulating gene expression through affecting nuclear export, mRNA stability and translation. Only recently high-throughput sequencing strategies began to emerge for transcriptome-wide profiling of poly(A) tail length in diverse developmental stages and organisms. However, there is currently no easy-to-use and universal tool for measuring poly(A) tails in sequencing data from different sequencing protocols. Here we established PolyAtailor, a unified and efficient framework, for identifying and analyzing poly(A) tails from PacBio-based long reads or next generation short reads. PolyAtailor provides two core functions for measuring poly(A) tails, namely Tail_map and Tail_scan, which can be used for profiling tails with or without using a reference genome. Particularly, PolyAtailor can identify all potential tails in a read, providing users with detailed information such as tail position, tail length, tail sequence and tail type. Moreover, PolyAtailor integrates rich functions for poly(A) tail and poly(A) site analyses, such as differential poly(A) length analysis, poly(A) site identification and annotation, and statistics and visualization of base composition in tails. We compared PolyAtailor with three latest methods, FLAMAnalysis, FLEPSeq and PAIsoSeqAnalysis, using data from three sequencing protocols in HeLa samples and Arabidopsis. Results show that PolyAtailor is effective in measuring poly(A) tail length and detecting significance of differential poly(A) length, which achieves much higher sensitivity and accuracy than competing methods. PolyAtailor is available at https://github.com/BMILAB/PolyAtailor.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available