4.6 Review

Biological Sequence Classification: A Review on Data and General Methods

Journal

RESEARCH
Volume 2022, Issue -, Pages -

Publisher

AMER ASSOC ADVANCEMENT SCIENCE
DOI: 10.34133/research.0011

Keywords

-

Funding

  1. Fundamental Research Funds for the Central Universities [YJS2205, JB180307]
  2. Innovation Fund of Xidian University [YJS2205]
  3. Natural Science Foundation of China [62072353, 61922020]
  4. China Postdoctoral Science Foundation [2022T150095]
  5. Sichuan Provincial Science Fund for Distinguished Young Scholars [2021JDJQ0025]
  6. Special Science Foundation of Quzhou [2021D004]

Ask authors/readers for more resources

The rapid growth of biological sequences has driven the application of machine learning in this field, focusing on function and modification classification. Establishing a support website to provide information and datasets for classification methods, discussing current challenges and future prospects.
With the rapid development of biotechnology, the number of biological sequences has grown exponentially. The continuous expansion of biological sequence data promotes the application of machine learning in biological sequences to construct predictive models for mining biological sequence information. There are many branches of biological sequence classification research. In this review, we mainly focus on the function and modification classification of biological sequences based on machine learning. Sequence-based prediction and analysis are the basic tasks to understand the biological functions of DNA, RNA, proteins, and peptides. However, there are hundreds of classification models developed for biological sequences, and the quite varied specific methods seem dizzying at first glance. Here, we aim to establish a long-term support website (http://lab.malab.cn/similar to acy/BioseqData/home.html), which provides readers with detailed information on the classification method and download links to relevant datasets. We briefly introduce the steps to build an effective model framework for biological sequence data. In addition, a brief introduction to single-cell sequencing data analysis methods and applications in biology is also included. Finally, we discuss the current challenges and future perspectives of biological sequence classification research.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available