4.7 Article

A robust model for read count data in exome sequencing experiments and implications for copy number variant calling

Journal

BIOINFORMATICS
Volume 28, Issue 21, Pages 2747-2754

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/bts526

Keywords

-

Funding

  1. Wellcome Trust Senior Research Fellow in Basic Biomedical Science [095198/Z/10/Z]
  2. Wellcome Trust grant [088838/Z/09/Z]
  3. Royal Society Research grant [RG090638]
  4. EU FP7 grant [261441]
  5. ERC Starting grant [260477]
  6. NIHR Cambridge Biomedical Research Centre
  7. MRC research grant [G1001158]
  8. NIHR Moorfields Biomedical Research Council grant
  9. Wellcome Trust [095198/Z/10/Z, 088838/Z/09/Z] Funding Source: Wellcome Trust
  10. MRC [G1001158, MC_PC_09003, G0701897, MC_G0901330] Funding Source: UKRI
  11. European Research Council (ERC) [260477] Funding Source: European Research Council (ERC)
  12. Great Ormond Street Hospital Childrens Charity [V1259, V1223] Funding Source: researchfish
  13. Medical Research Council [G1001158, G0701897, MC_PC_09003, MC_G0901330] Funding Source: researchfish
  14. National Institute for Health Research [NF-SI-0611-10001] Funding Source: researchfish
  15. Parkinson's UK [G-1107] Funding Source: researchfish

Ask authors/readers for more resources

Motivation: Exome sequencing has proven to be an effective tool to discover the genetic basis of Mendelian disorders. It is well established that copy number variants (CNVs) contribute to the etiology of these disorders. However, calling CNVs from exome sequence data is challenging. A typical read depth strategy consists of using another sample (or a combination of samples) as a reference to control for the variability at the capture and sequencing steps. However, technical variability between samples complicates the analysis and can create spurious CNV calls. Results: Here, we introduce ExomeDepth, a new CNV calling algorithm designed to control for this technical variability. ExomeDepth uses a robust model for the read count data and uses this model to build an optimized reference set in order to maximize the power to detect CNVs. As a result, ExomeDepth is effective across a wider range of exome datasets than the previously existing tools, even for small (e.g. one to two exons) and heterozygous deletions. We used this new approach to analyse exome data from 24 patients with primary immunodeficiencies. Depending on data quality and the exact target region, we find between 170 and 250 exonic CNV calls per sample. Our analysis identified two novel causative deletions in the genes GATA2 and DOCK8.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available