4.7 Review

Elucidating the functional roles of prokaryotic proteins using big data and artificial intelligence

期刊

FEMS MICROBIOLOGY REVIEWS
卷 47, 期 1, 页码 -

出版社

OXFORD UNIV PRESS
DOI: 10.1093/femsre/fuad003

关键词

hypothetical proteins; annotation; omics data; machine learning; deep learning; metadata; databases

向作者/读者索取更多资源

Annotating protein sequences is crucial for understanding microbial diversity and potential applications, but traditional methods are limited by the vast amount of 'omics' data. Hypothetical proteins (HPs) are a knowledge gap with hidden potential. Artificial intelligence (AI) and machine learning algorithms offer opportunities to leverage 'Big Data' for comprehensive genome annotations. This review explores the aims, methods, and recent research examples of protein annotation, with a focus on machine and deep learning algorithms.
Annotating protein sequences according to their biological functions is one of the key steps in understanding microbial diversity, metabolic potentials, and evolutionary histories. However, even in the best-studied prokaryotic genomes, not all proteins can be characterized by classical in vivo, in vitro, and/or in silico methods-a challenge rapidly growing alongside the advent of next-generation sequencing technologies and their enormous extension of 'omics' data in public databases. These so-called hypothetical proteins (HPs) represent a huge knowledge gap and hidden potential for biotechnological applications. Opportunities for leveraging the available 'Big Data' have recently proliferated with the use of artificial intelligence (AI). Here, we review the aims and methods of protein annotation and explain the different principles behind machine and deep learning algorithms including recent research examples, in order to assist both biologists wishing to apply AI tools in developing comprehensive genome annotations and computer scientists who want to contribute to this leading edge of biological research. Different principles and methods to functionally characterize and annotate prokaryotic proteins are reviewed, with a special emphasis on machine learning and deep learning algorithms.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据