4.6 Article

Selective Inference for Testing Trees and Edges in Phylogenetics

期刊

出版社

FRONTIERS MEDIA SA
DOI: 10.3389/fevo.2019.00174

关键词

statistical hypothesis testing; multiple testing; selection bias; model selection; Akaike information criterion; bootstrap resampling; hierarchical clustering; variable selection

类别

资金

  1. JSPS KAKENHI Grant [16H02789, 16K16024]
  2. Grants-in-Aid for Scientific Research [16K16024, 16H02789] Funding Source: KAKEN

向作者/读者索取更多资源

Selective inference is considered for testing trees and edges in phylogenetic tree selection from molecular sequences. This improves the previously proposed approximately unbiased test by adjusting the selection bias when testing many trees and edges at the same time. The newly proposed selective inference p-value is useful for testing selected edges to claim that they are significantly supported if p > 1-alpha, whereas the non-selective p-value is still useful for testing candidate trees to claim that they are rejected if p < alpha. The selective p-value controls the type-I error conditioned on the selection event, whereas the non-selective p-value controls it unconditionally. The selective and non-selective approximately unbiased p-values are computed from two geometric quantities called signed distance and mean curvature of the region representing tree or edge of interest in the space of probability distributions. These two geometric quantities are estimated by fitting a model of scaling-law to the non-parametric multiscale bootstrap probabilities. Our general method is applicable to a wider class of problems; phylogenetic tree selection is an example of model selection, and it is interpreted as the variable selection of multiple regression, where each edge corresponds to each predictor. Our method is illustrated in a previously controversial phylogenetic analysis of human, rabbit and mouse.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据