4.6 Article

State of art fusion-finder algorithms are suitable to detect transcription-induced chimeras in normal tissues?

期刊

BMC BIOINFORMATICS
卷 14, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/1471-2105-14-S7-S2

关键词

-

资金

  1. Italian Association for Cancer Research
  2. Epigenomics Flagship Project EPIGEN, MIUR-CNR
  3. Italian Ministero dell'Universita e della Ricerca
  4. University of Torino
  5. European Seventh framework program, Health.2012.1.2-1, NGS-PTL [306242]
  6. King Abdulaziz University of Saudi Arabia [10-15-1432/HICI]
  7. FP7 EU Health Project Next Generation Sequencing Platform for Targeted Personalized Therapy of Leukemia (NGS - PTL) [306242]
  8. Regione Piemonte

向作者/读者索取更多资源

Background: RNA-seq has the potential to discover genes created by chromosomal rearrangements. Fusion genes, also known as chimeras, are formed by the breakage and re-joining of two different chromosomes. It is known that chimeras have been implicated in the development of cancer. Few publications in the past showed the presence of fusion events also in normal tissue, but with very limited overlaps between their results. More recently, two fusion genes in normal tissues were detected using both RNA-seq and protein data. Due to heterogeneous results in identifying chimeras in normal tissue, we decided to evaluate the efficacy of state of the art fusion finders in detecting chimeras in RNA-seq data from normal tissues. Results: We compared the performance of six fusion-finder tools: FusionHunter, FusionMap, FusionFinder, MapSplice, deFuse and TopHat-fusion. To evaluate the sensitivity we used a synthetic dataset of fusion-products, called positive dataset; in these experiments FusionMap, FusionFinder, MapSplice, and TopHat-fusion are able to detect more than 78% of fusion genes. All tools were error prone with high variability among the tools, identifying some fusion genes not present in the synthetic dataset. To better investigate the false discovery chimera detection rate, synthetic datasets free of fusion-products, called negative datasets, were used. The negative datasets have different read lengths and quality scores, which allow detecting dependency of the tools on both these features. FusionMap, FusionFinder, mapSplice, deFuse and TopHat-fusion were error-prone. Only FusionHunter results were free of false positive. FusionMap gave the best compromise in terms of specificity in the negative dataset and of sensitivity in the positive dataset. Conclusions: We have observed a dependency of the tools on read length, quality score and on the number of reads supporting each chimera. Thus, it is important to carefully select the software on the basis of the structure of the RNA-seq data under analysis. Furthermore, the sensitivity of chimera detection tools does not seem to be sufficient to provide results consistent with those obtained in normal tissues on the basis of fusion events extracted from published data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据