4.8 Article

BioSeq-BLM: a platform for analyzing DNA, RNA and protein sequences based on biological language models

期刊

NUCLEIC ACIDS RESEARCH
卷 49, 期 22, 页码 -

出版社

OXFORD UNIV PRESS
DOI: 10.1093/nar/gkab829

关键词

-

资金

  1. National Natural Science Foundation of China [61822306, 61861146002, 61732012]
  2. Beijing Natural Science Foundation [JQ19019]
  3. National Key R&D Program of China [2018AAA0100100]

向作者/读者索取更多资源

This study discusses 155 different biological language models for DNA, RNA, and protein sequence analysis, extending them into the BioSeq-BLM system with superior performance in biological sequence analysis. The establishment of a corresponding web server and standalone package aims to assist readers in conducting their own experiments.
In order to uncover the meanings of 'book of life', 155 different biological language models (BLMs) for DNA, RNA and protein sequence analysis are discussed in this study, which are able to extract the linguistic properties of 'book of life'. We also extend the BLMs into a system called BioSeq-BLM for automatically representing and analyzing the sequence data. Experimental results show that the predictors generated by BioSeq-BLM achieve comparable or even obviously better performance than the exiting state-of-the-art predictors published in literatures, indicating that BioSeq-BLM will provide new approaches for biological sequence analysis based on natural language processing technologies, and contribute to the development of this very important field. In order to help the readers to use BioSeq-BLM for their own experiments, the corresponding web server and stand-alone package are established and released, which can be freely accessed at http: //bliulab.net/BioSeq-BLIW.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据