期刊
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
卷 116, 期 534, 页码 858-873出版社
TAYLOR & FRANCIS INC
DOI: 10.1080/01621459.2020.1778481
关键词
Adaptive LASSO; Consistency; l(1) regularization; Phylogenetics; Model selection; Sparsity
资金
- National Institutes of Health [R01-GM113246, R01-AI120961, U19-AI117891, U54-GM111274]
- National Science Foundation [CISE-1564137]
- Howard Hughes Medical Institute
- Simons Foundation
This research introduces adaptive-LASSO-type regularization estimators to identify zero-length branches in phylogenetic trees, proving regularization to be a practical method in phylogenetics. This approach helps uncover special features in densely sampled phylogenetic trees, such as sampled ancestors and polytomies.
Phylogenetic tree inference using deep DNA sequencing is reshaping our understanding of rapidly evolving systems, such as the within-host battle between viruses and the immune system. Densely sampled phylogenetic trees can contain special features, including sampled ancestors in which we sequence a genotype along with its direct descendants, and polytomies in which multiple descendants arise simultaneously. These features are apparent after identifying zero-length branches in the tree. However, current maximum-likelihood based approaches are not capable of revealing such zero-length branches. In this article, we find these zero-length branches by introducing adaptive-LASSO-type regularization estimators for the branch lengths of phylogenetic trees, deriving their properties, and showing regularization to be a practically useful approach for phylogenetics. Supplementary materials for this article are available online.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据