☆ 4.7 Article

Querying multiple sets of P-values through composed hypothesis testing

BIOINFORMATICS (2022)

Journal

BIOINFORMATICS

Volume 38, Issue 1, Pages 141-148

Publisher

OXFORD UNIV PRESS

DOI: 10.1093/bioinformatics/btab592

Keywords

Funding

Indo-French Center for Applied Mathematics (IFCAM)
'Investissement d'Avenir' project [ANR-10-BTBR-0001]
Department of Biotechnology, Govt. of India
Agence Nationale de la Recherche (ANR) [ANR-10-BTBR-0001] Funding Source: Agence Nationale de la Recherche (ANR)

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This study introduces the concept of composed hypothesis and rephrases the problem of testing complex hypotheses as a classification task, demonstrating that finding items for which the composed null hypothesis is rejected boils down to fitting a mixture model and classifying the items according to their posterior probabilities. The study showcases the efficiency and usefulness of the developed method in simulations and on two different applications, providing valuable biological insight.

Motivation: Combining the results of different experiments to exhibit complex patterns or to improve statistical power is a typical aim of data integration. The starting point of the statistical analysis often comes as a set of P-values resulting from previous analyses, that need to be combined flexibly to explore complex hypotheses, while guaranteeing a low proportion of false discoveries. Results: We introduce the generic concept of composed hypothesis, which corresponds to an arbitrary complex combination of simple hypotheses. We rephrase the problem of testing a composed hypothesis as a classification task and show that finding items for which the composed null hypothesis is rejected boils down to fitting a mixture model and classifying the items according to their posterior probabilities. We show that inference can be efficiently performed and provide a thorough classification rule to control for type I error. The performance and the usefulness of the approach are illustrated in simulations and on two different applications. The method is scalable, does not require any parameter tuning, and provided valuable biological insight on the considered application cases.

Querying multiple sets of P-values through composed hypothesis testing

Journal

BIOINFORMATICS

Publisher

OXFORD UNIV PRESS

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Querying multiple sets of P-values through composed hypothesis testing

Journal

BIOINFORMATICS

Publisher

OXFORD UNIV PRESS

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper