4.7 Article Proceedings Paper

Bacteriophage classification for assembled contigs using graph convolutional network

期刊

BIOINFORMATICS
卷 37, 期 -, 页码 I25-I33

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btab293

关键词

-

资金

  1. Research Grants Council of the Hong Kong Special Administrative Region, China [CityU 11206819]
  2. HKIDS [9360163]
  3. NSF of China [31972847]

向作者/读者索取更多资源

Bacteriophages, viruses that infect bacteria, play crucial roles in microbial biology, but their classification faces challenges due to high diversity and limited knowledge. A novel semi-supervised learning model called PhaGCN combines DNA and protein sequence features to classify phage contigs effectively, showing competitive performance against existing tools in both simulated and real sequencing data.
Motivation: Bacteriophages (aka phages), which mainly infect bacteria, play key roles in the biology of microbes. As the most abundant biological entities on the planet, the number of discovered phages is only the tip of the iceberg. Recently, many new phages have been revealed using high-throughput sequencing, particularly metagenomic sequencing. Compared to the fast accumulation of phage-like sequences, there is a serious lag in taxonomic classification of phages. High diversity, abundance and limited known phages pose great challenges for taxonomic analysis. In particular, alignment-based tools have difficulty in classifying fast accumulating contigs assembled from metagenomic data. Results: In this work, we present a novel semi-supervised learning model, named PhaGCN, to conduct taxonomic classification for phage contigs. In this learning model, we construct a knowledge graph by combining the DNA sequence features learned by convolutional neural network and protein sequence similarity gained from gene-sharing network. Then we apply graph convolutional network to utilize both the labeled and unlabeled samples in training to enhance the learning ability. We tested PhaGCN on both simulated and real sequencing data. The results clearly show that our method competes favorably against available phage classification tools.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据