4.7 Article

Soft windowing application to improve analysis of high-throughput phenotyping data

Journal

BIOINFORMATICS
Volume 36, Issue 5, Pages 1492-1500

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btz744

Keywords

-

Funding

  1. NIH [UM1 HG006370, UM1 OD023221, UM1OD023222, UM1 HG006348, U42 OD011174, U54 HG005348]
  2. Genome Canada [OGI-051, 137]
  3. Ontario Genomics [OGI-051, 137]
  4. Management Expenses Grant for RIKEN BioResource Research Center, MEXT
  5. Korea Mouse Phenotyping Project of the Ministry of Science, ICT and Future Planning through the National Research Foundation [2017M3A9D5A01052447]
  6. French National Centre for Scientific Research (CNRS)
  7. French National Institute of Health and Medical Research (INSERM)
  8. University of Strasbourg
  9. Centre Europeen de Recherche en Biomedecine
  10. French state funds through the 'Agence Nationale de la Recherche' under the frame programme Investissements d'Avenir labelled [ANR-10-IDEX-0002-02, ANR-10-LABX-0030-INRT, ANR-10-INBS-07 PHENOMIN]
  11. German Federal Ministry of Education and Research: Infrafrontier [01KX1012]
  12. German Center for Diabetes Research (DZD), EU Horizon2020: IPAD-MD [653961, FP7-HEALTH-F4-2010-261492]
  13. MRC [MC_U142684171, MC_U142684172] Funding Source: UKRI

Ask authors/readers for more resources

Motivation: High-throughput phenomic projects generate complex data from small treatment and large control groups that increase the power of the analyses but introduce variation over time. A method is needed to utlize a set of temporally local controls that maximizes analytic power while minimizing noise from unspecified environmental factors. Results: Here we introduce 'soft windowing', a methodological approach that selects a window of time that includes the most appropriate controls for analysis. Using phenotype data from the International Mouse Phenotyping Consortium (IMPC), adaptive windows were applied such that control data collected proximally to mutants were assigned the maximal weight, while data collected earlier or later had less weight. We applied this method to IMPC data and compared the results with those obtained from a standard non-windowed approach. Validation was performed using a resampling approach in which we demonstrate a 10% reduction of false positives from 2.5 million analyses. We applied the method to our production analysis pipeline that establishes genotype-phenotype associations by comparing mutant versus control data. We report an increase of 30% in significant P-values, as well as linkage to 106 versus 99 disease models via phenotype overlap with the soft-windowed and non-windowed approaches, respectively, from a set of 2082 mutant mouse lines. Our method is generalizable and can benefit large-scale human phenomic projects such as the UK Biobank and the All of Us resources.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available