4.7 Article

OUTRIDER: A Statistical Method for Detecting Aberrantly Expressed Genes in RNA Sequencing Data

期刊

AMERICAN JOURNAL OF HUMAN GENETICS
卷 103, 期 6, 页码 907-917

出版社

CELL PRESS
DOI: 10.1016/j.ajhg.2018.10.025

关键词

-

资金

  1. German Bundesministerium fur Bildung und Forschung (BMBF) through the German Network for Mitochondrial Disorders [01GM1113C]
  2. E-Rare project GENOMIT [01GM1207]
  3. Graduate School of Quantitative Biosciences Munich
  4. Katholischer Akademischer Auslander-Dienst
  5. EU Horizon2020 Collaborative Research Project SOUND [633974]
  6. Common Fund of the Office of the Director of the National Institutes of Health
  7. National Cancer Institute, National Human Genome Research Institute, National Heart, Lung, and Blood Institute, National Institute on Drug Abuse, National Institute of Mental Health, and National Institute of Neurological Disorders and Stroke
  8. National Institute of Neurological Disorders and Stroke
  9. Juniorverbund in der Systemmedizin mitOmics'' [FKZ 01ZX1405A]

向作者/读者索取更多资源

RNA sequencing (RNA-seq) is gaining popularity as a complementary assay to genome sequencing for precisely identifying the molecular causes of rare disorders. A powerful approach is to identify aberrant gene expression levels as potential pathogenic events. However, existing methods for detecting aberrant read counts in RNA-seq data either lack assessments of statistical significance, so that establishing cutoffs is arbitrary, or rely on subjective manual corrections for confounders. Here, we describe OUTRIDER (Outlier in RNA-Seq Finder), an algorithm developed to address these issues. The algorithm uses an autoencoder to model read-count expectations according to the gene covariation resulting from technical, environmental, or common genetic variations. Given these expectations, the RNA-seq read counts are assumed to follow a negative binomial distribution with a gene-specific dispersion. Outliers are then identified as read counts that significantly deviate from this distribution. The model is automatically fitted to achieve the best recall of artificially corrupted data. Precision-recall analyses using simulated outlier read counts demonstrated the importance of controlling for covariation and significance-based thresholds. OUTRIDER is open source and includes functions for filtering out genes not expressed in a dataset, for identifying outlier samples with too many aberrantly expressed genes, and for detecting aberrant gene expression on the basis of false-discovery-rate-adjusted p values. Overall, OUTRIDER provides an end-to-end solution for identifying aberrantly expressed genes and is suitable for use by rare-disease diagnostic platforms.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据