4.6 Article

Learning Hyperbolic Embedding for Phylogenetic Tree Placement and Updates

期刊

BIOLOGY-BASEL
卷 11, 期 9, 页码 -

出版社

MDPI
DOI: 10.3390/biology11091256

关键词

distance-based phylogenetics; phylogenetic placement; gene sequence embedding; deep learning; metric tree embedding; hyperbolic spaces

类别

资金

  1. National Institute of Health (NIH) [R35GM142725]

向作者/读者索取更多资源

This paper demonstrates how the conventional Euclidean deep learning methods in phylogenetics can benefit from using hyperbolic geometry. The results show that hyperbolic embeddings have lower distance errors and can be used to update species trees.
Simple Summary We show how the conventional (Euclidean) deep learning methods developed for phylogenetics can benefit from using hyperbolic geometry. The results point to lowered distance distortion and better accuracy in updating trees but not necessarily for phylogenetic placement. Phylogenetic placement, used widely in ecological analyses, seeks to add a new species to an existing tree. A deep learning approach was previously proposed to estimate the distance between query and backbone species by building a map from gene sequences to a high-dimensional space that preserves species tree distances. They then use a distance-based placement method to place the queries on that species tree. In this paper, we examine the appropriate geometry for faithfully representing tree distances while embedding gene sequences. Theory predicts that hyperbolic spaces should provide a drastic reduction in distance distortion compared to the conventional Euclidean space. Nevertheless, hyperbolic embedding imposes its own unique challenges related to arithmetic operations, exponentially-growing functions, and limited bit precision, and we address these challenges. Our results confirm that hyperbolic embeddings have substantially lower distance errors than Euclidean space. However, these better-estimated distances do not always lead to better phylogenetic placement. We then show that the deep learning framework can be used not just to place on a backbone tree but to update it to obtain a fully resolved tree. With our hyperbolic embedding framework, species trees can be updated remarkably accurately with only a handful of genes.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据