4.6 Review

Biological Sequence Classification: A Review on Data and General Methods

Related references

Note: Only part of the references are listed.
Article Biochemical Research Methods

Characterizing viral circRNAs and their application in identifying circRNAs in viruses

Mengting Niu et al.

Summary: Circular RNAs (circRNAs) are special non-coding RNAs with a circular structure, playing important roles in various biological activities. Viral circRNAs, encoded by viruses, have been found in different types of viruses. However, the characteristics and functions of viral circRNAs are still unknown. Comparative analysis reveals that viral circRNAs are less conserved compared to animal circRNAs, suggesting rapid evolution. Furthermore, viral circRNAs show similarities in nucleic acid composition but distinct differences in secondary structure and autocorrelation characteristics compared to animal circRNAs. Based on these characteristics, a machine learning model was developed to predict viral circRNAs. Additionally, analysis indicates potential interactions between viral circRNAs and human miRNAs, as well as their involvement in various KEGG pathways related to the nervous system and cancer.

BRIEFINGS IN BIOINFORMATICS (2022)

Article Biochemical Research Methods

MGF6mARice: prediction of DNA N6-methyladenine sites in rice by exploiting molecular graph feature and residual block

Mengya Liu et al.

Summary: This paper proposes a novel deep learning method, MGF6mARice, for predicting 6mA sites in rice by devising DNA molecular graph feature and residual block structure. Experimental results show that this method outperforms existing approaches in 6mA prediction.

BRIEFINGS IN BIOINFORMATICS (2022)

Review Cell Biology

A guide to machine learning for biologists

Joe G. Greener et al.

Summary: This passage discusses the application of machine learning in the analysis of biological data and provides guidance for experimentalists. The increasing scale and complexity of biological data have led to a growing use of machine learning in biology.

NATURE REVIEWS MOLECULAR CELL BIOLOGY (2022)

Article Biochemical Research Methods

SgRNA-RF: Identification of SgRNA On-Target Activity With Imbalanced Datasets

Mengting Niu et al.

Summary: Single-guide RNA (sgRNA) is a non-coding RNA that guides the insertion or deletion of uridine residues into kinetoplastid during RNA editing. In this paper, a new classifier called SgRNA-RF is developed, which extracts features of nucleic acid composition and structure from the on-target activity sgRNA sequence and identifies them using the random forest algorithm. The classifier significantly improves the identification accuracy and provides a user-friendly web server for implementation.

IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS (2022)

Article Biochemical Research Methods

RFhy-m2G: Identification of RNA N2-methylguanosine modification sites based on random forest and hybrid features

Chunyan Ao et al.

Summary: A novel predictor, RFhy-m2G, was developed in this study to identify m2G modification sites using hybrid features and random forest. The predictor achieved high accuracies through feature fusion and optimal feature selection.

METHODS (2022)

Article Biochemical Research Methods

Transcriptome Analysis Reveals Possible Virulence Factors of Paragonimus proliferus

Sheng-Hao Li et al.

Summary: This study identified possible virulence factors of P. proliferus through transcriptome sequencing and homology analysis. Most of the predicted virulence factors that simultaneously belonged to differentially expressed homologous genes were lower expressed in P. proliferus.

CURRENT BIOINFORMATICS (2021)

Article Biochemical Research Methods

DeepATT: a hybrid category attention neural network for identifying functional effects of DNA sequences

Jiawei Li et al.

Summary: Quantifying DNA properties is a challenging task in human genomics, and understanding non-coding DNA functions is crucial for biological research. A hybrid deep neural network method, DeepATT, is proposed to identify regulatory functions on millions of DNA sequences, outperforming existing tools and reducing parameters while maintaining accuracy. The model captures regulatory motifs, grammar, and selects features efficiently, providing insights into DNA function correlations.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Biochemical Research Methods

AntiCP 2.0: an updated model for predicting anticancer peptides

Piyush Agrawal et al.

Summary: The study developed a computational model for predicting and designing anticancer peptides (ACPs), revealing residue composition preference, positional preference, and motif features of ACPs. Machine learning models were utilized and trained on different datasets, with the best models implemented on the webserver AntiCP 2.0 for free access.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Biochemical Research Methods

DeepTL-Ubi: A novel deep transfer learning method for effectively predicting ubiquitination sites of multiple species

Yu Liu et al.

Summary: The paper presents a novel transfer deep learning method named DeepTL-Ubi for predicting ubiquitination sites of multiple species. This method enhances the performance of species-specific ubiquitination site prediction by transferring common knowledge from human data to other species, effectively solving the problem of insufficient training data for other species.

METHODS (2021)

Article Computer Science, Artificial Intelligence

A Convolutional Neural Network Using Dinucleotide One-hot Encoder for identifying DNA N6-Methyladenine Sites in the Rice Genome

Zhibin Lv et al.

Summary: N6-methyladenine (m(6)A) is a crucial epigenetic modification related to the control of various DNA processes. The iRicem6A-CNN protocol, using machine learning, achieved high accuracy in identifying m(6)A sites in the rice genome, outperforming other predictors.

NEUROCOMPUTING (2021)

Review Biochemical Research Methods

Goals and approaches for each processing step for single-cell RNA sequencing data

Zilong Zhang et al.

Summary: Single-cell RNA sequencing has revolutionized the study of gene expression at a cellular level, but the noise and dimensionality of the data pose challenges for statistical analysis. While there are many tools available for scRNA-seq data analysis, a universal gold standard pipeline is still lacking. Understanding bioinformatics and computational issues can help in selecting the appropriate tools for data analysis.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Biochemical Research Methods

ITP-Pred: an interpretable method for predicting, therapeutic peptides with fused features low-dimension representation

Lijun Cai et al.

Summary: The development of an Interpretable Therapeutic Peptide Prediction (ITP-Pred) model based on efficient feature fusion showed higher prediction performance in cross-validation and independent verification experiments, providing guidance for designing better models.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Biochemical Research Methods

SubLocEP: a novel ensemble predictor of subcellular localization of eukaryotic mRNA based on machine learning

Jing Li et al.

Summary: This study introduces a new two-layer integrated prediction model SubLocEP for more accurate prediction of subcellular localization of eukaryotic mRNA, which comprehensively considers additional feature attributes and is combined with LightGBM to generate single feature classifiers. The model demonstrates good prediction stability and generalization ability on independent datasets.

BRIEFINGS IN BIOINFORMATICS (2021)

Review Biotechnology & Applied Microbiology

Prediction of bio-sequence modifications and the associations with diseases

Chunyan Ao et al.

Summary: This review comprehensively summarizes the predictors for protein, RNA, and DNA modification sites and their association with diseases, emphasizing the importance of accurately identifying and understanding modification sites for disease research.

BRIEFINGS IN FUNCTIONAL GENOMICS (2021)

Article Chemistry, Medicinal

TargetDBP plus : Enhancing the Performance of Identifying DNA-Binding Proteins via Weighted Convolutional Features

Jun Hu et al.

Summary: A new method called TargetDBP+ was developed to enhance the performance of identifying DNA-binding proteins, and a new benchmark data set named UniSwiss was created for evaluation. Experimental results showed that TargetDBP+ outperformed other methods in accuracy and precision.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2021)

Article Biochemistry & Molecular Biology

HSM6AP: a high-precision predictor for the Homo sapiens N6-methyladenosine (m∧6 A) based on multiple weights and feature stitching

Jing Li et al.

Summary: Recent studies have shown that RNA methylation modification affects RNA function and diseases. The HSM6AP high-precision predictor, proposed in this study, explores how weighting strategies and feature selection impact model performance.

RNA BIOLOGY (2021)

Article Biotechnology & Applied Microbiology

Predicting Cell Wall Lytic Enzymes Using Combined Features

Xiao-Yang Jing et al.

Summary: An improved method for predicting cell wall lytic enzymes was proposed in this study, utilizing support vector machine and a set of features to address the crisis of antimicrobial resistance. By employing synthetic minority over-sampling technique for data balancing and feature selection, the method shows excellent performance in predicting cell wall lytic enzymes.

FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY (2021)

Article Biochemical Research Methods

Anticancer peptides prediction with deep representation learning features

Zhibin Lv et al.

Summary: The study introduced a computational method named iACP-DRLF for identifying anticancer peptides, utilizing light gradient boosting machine algorithm and two sequence embedding technologies. Results showed that deep representation learning features significantly enhanced the models' ability to differentiate anticancer peptides.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Oncology

Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries

Hyuna Sung et al.

Summary: The global cancer burden in 2020 saw an estimated 19.3 million new cancer cases and almost 10.0 million cancer deaths. Female breast cancer surpassed lung cancer as the most commonly diagnosed cancer, while lung cancer remained the leading cause of cancer death. These trends are expected to rise in 2040, with transitioning countries experiencing a larger increase compared to transitioned countries due to demographic changes and risk factors associated with globalization and a growing economy. Efforts to improve cancer prevention measures and provision of cancer care in transitioning countries will be crucial for global cancer control.

CA-A CANCER JOURNAL FOR CLINICIANS (2021)

Article Biochemistry & Molecular Biology

iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization

Zhen Chen et al.

Summary: iLearnPlus is the first machine-learning platform with graphical- and web-based interfaces for analysis and predictions using nucleic acid and protein sequences, providing a comprehensive set of algorithms and automating sequence-based feature extraction and analysis. It caters to experienced bioinformaticians and biologists with no programming background, showcasing its capabilities through case studies on lncRNA prediction and crotonylation site prediction.

NUCLEIC ACIDS RESEARCH (2021)

Article Biochemistry & Molecular Biology

Computational identification of ubiquitination sites in Arabidopsis thaliana using convolutional neural networks

Xiaofeng Wang et al.

Summary: Two convolutional neural network models were proposed for predicting ubiquitination sites in Arabidopsis thaliana, which outperformed other models. The study also analyzed the physicochemical properties of amino acids and the influence of CNN structure on prediction performance. Additionally, potential ubiquitination sites in the global Arabidopsis proteome were predicted.

PLANT MOLECULAR BIOLOGY (2021)

Article Biochemical Research Methods

Deep6mA: A deep learning framework for exploring similar patterns in DNA N6-methyladenine sites across different species

Zutan Li et al.

Summary: This study introduces a deep learning framework named Deep6mA for predicting DNA 6mA sites with high accuracy. The research reveals that DNA sequences containing 6mA sites show conservation across different species, and 6mA tends to occur at GAGG motifs and in the TATA box of the promoter.

PLOS COMPUTATIONAL BIOLOGY (2021)

Article Genetics & Heredity

DNN-m6A: A Cross-Species Method for Identifying RNA N6-methyladenosine Sites Based on Deep Neural Network with Multi-Information Fusion

Lu Zhang et al.

Summary: In this study, a novel cross-species computational method DNN-m6A based on deep neural network was proposed to identify m6A sites in multiple tissues. Through five-fold cross-validation, the DNN-m6A method showed higher accuracy and AUC compared to existing methods, demonstrating excellent performance.

GENES (2021)

Review Biochemical Research Methods

A comprehensive review of the imbalance classification of protein post-translational modifications

Lijun Dou et al.

Summary: Post-translational modifications (PTMs) are crucial in regulating protein functions and associated with various pathologies. Machine learning-based predictors are developed for rapid identification of PTMs due to the high cost and time consumption of sequencing techniques. However, the imbalance in data distribution poses challenges in reliability and application of prediction tools, and proposed solutions aim to enhance efficiency in imbalance learning for advanced predictors.

BRIEFINGS IN BIOINFORMATICS (2021)

Review Biotechnology & Applied Microbiology

A systematic review of computational methods for predicting long noncoding RNAs

Xinran Xu et al.

Summary: This review introduces the development of computational methods for lncRNA prediction and presents a new Python package, ezLncPred, which provides a convenient way to utilize nine state-of-the-art lncRNA prediction methods. The challenges and future directions of lncRNA prediction are also discussed in the paper.

BRIEFINGS IN FUNCTIONAL GENOMICS (2021)

Article Chemistry, Multidisciplinary

Targeting RNA with Next- and Third-Generation Sequencing Improves Pathogen Identification in Clinical Samples

Na Zhao et al.

Summary: Utilizing RNA sequencing and third-generation sequencing technology for pathogen identification can improve the ratio of microbial reads and accelerate clinical diagnosis, showing significant potential.

ADVANCED SCIENCE (2021)

Article Biochemistry & Molecular Biology

Accurate identification of RNA D modification using multiple features

Lijun Dou et al.

Summary: The researchers proposed a novel predictor, iRNAD_XGBoost, to identify potential D modification sites in tRNAs using multiple RNA sequence representations. The optimized model showed high accuracy in cross-validation tests and demonstrated consistent prediction efficiencies for positive and negative samples.

RNA BIOLOGY (2021)

Article Biochemical Research Methods

PEPRF: Identification of Essential Proteins by Integrating Topological Features of PPI Network and Sequence-Based Features via Random Forest

Chuanyan Wu et al.

Summary: This study presented a computational model (PEPRF) based on machine learning to identify essential proteins, which achieved a high AUC and accuracy by extracting different features and selecting the most contributing ones for identification.

CURRENT BIOINFORMATICS (2021)

Article Biotechnology & Applied Microbiology

CWLy-RF: A novel approach for identifying cell wall lyases based on random forest classifier

Shihu Jiao et al.

Summary: Researchers identified cell wall lyases as effective antibacterial agents against drug-resistant pathogenic bacteria and proposed a new predictor, CWLy-RF, based on the RF algorithm for accurate and efficient lyase identification.

GENOMICS (2021)

Article Multidisciplinary Sciences

Attention-based multi-label neural networks for integrated prediction and interpretation of twelve widely occurring RNA modifications

Zitao Song et al.

Summary: Recent study introduces MultiRM, a method that predicts and interprets twelve common post-transcriptional RNA modifications simultaneously, revealing potential associations among different types of RNA modifications. This research offers a solution for detecting multiple RNA modifications and gaining a deeper understanding of the mechanisms behind sequence-based RNA modifications.

NATURE COMMUNICATIONS (2021)

Article Computer Science, Information Systems

iPro2L-PSTKNC: A Two-Layer Predictor for Discovering Various Types of Promoters by Position Specific of Nucleotide Composition

Yinuo Lyu et al.

Summary: Promoters, regulatory elements located near transcription start sites, initiate gene transcription. A novel two-layer predictor, iPro2L-PSTKNC, based on a new feature extraction model, PSTKNC, is developed to identify E.coli genome promoters effectively. The ensemble classification SVM shows the best performance with high accuracy and MCC.

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS (2021)

Article Computer Science, Information Systems

rBPDL:Predicting RNA-Binding Proteins Using Deep Learning

Mengting Niu et al.

Summary: Researchers developed a network model rBPDL using a convolutional neural network and long short-term memory for multilabel classification of RBPs, and used a voting algorithm for ensemble learning to achieve better prediction results. The model significantly improved identification performance for the RBP68 dataset, and analysis of the AUC statistics showed similar RBP identification performance in the same domain.

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS (2021)

Article Biochemistry & Molecular Biology

Identifying DNA N4-methylcytosine sites in the rosaceae genome with a deep learning model relying on distributed feature representation

Jhabindra Khanal et al.

Summary: DNA 4mC is a key epigenetic modification involved in biological functions across different species. The computational method 4mC-w2vec enhances feature selection and performance in identifying relevant sites, surpassing current tools in genomic datasets.

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL (2021)

Article Biochemistry & Molecular Biology

MULocDeep: A deep-learning framework for protein subcellular and suborganellar localization prediction with residue-level interpretation

Yuexu Jiang et al.

Summary: The paper introduces a deep learning-based protein localization prediction framework, MULocDeep, which can predict multiple localizations of a protein at both subcellular and suborganellar levels. By collecting a comprehensive dataset of suborganellar localizations and evaluating the performance, MULocDeep outperforms other major methods in terms of both sub-cellular and suborganellar levels.

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL (2021)

Review Biotechnology & Applied Microbiology

Sequence representation approaches for sequence-based protein prediction tasks that use deep learning

Feifei Cui et al.

Summary: The article summarizes the main methods used to represent protein sequence data in bioinformatics, including end-to-end embedding, non-contextual embedding, and embedding methods using transfer learning, as well as other methods applied for specific tasks. It also reviews the theoretical architectures of different types of embedding models and their development to help in selecting the most suitable model for specific requirements.

BRIEFINGS IN FUNCTIONAL GENOMICS (2021)

Article Biotechnology & Applied Microbiology

PseudotimeDE: inference of differential gene expression along cell pseudotime with well-calibrated p-values from single-cell RNA sequencing data

Dongyuan Song et al.

Summary: PseudotimeDE is a differential gene expression identification method that adapts to various pseudotime inference methods, considers pseudotime inference uncertainty, and provides well-calibrated p-values. Comprehensive simulations and real-data applications confirm that PseudotimeDE outperforms existing methods in controlling false discovery rate and power.

GENOME BIOLOGY (2021)

Article Biochemical Research Methods

iEnhancer-XG: interpretable sequence-based enhancers and their strength predictor

Lijun Cai et al.

Summary: The study proposed a two-layer predictor named 'iEnhancer-XG' for enhancer recognition, using XGBoost as the base classifier and five feature extraction methods. By applying ensemble learning and SHapley Additive explanations, the prediction accuracy and credibility were improved.

BIOINFORMATICS (2021)

Review Mathematical & Computational Biology

Post-translational modifications in proteins: resources, tools and prediction methods

Shahin Ramazi et al.

Summary: Posttranslational modifications (PTMs) involve modifications to amino acid side chains in proteins after biosynthesis, affecting various aspects of protein functions. Disruptions in PTMs can lead to diseases, emphasizing the need for computational methods to predict PTMs. High-throughput experimental methods for PTM discovery are laborious, prompting the exploration of computational tools and databases to advance research in this area.

DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION (2021)

Article Biochemical Research Methods

ACPred-Fuse: fusing multi-view information improves the prediction of anticancer peptides

Bing Rao et al.

BRIEFINGS IN BIOINFORMATICS (2020)

Review Biochemistry & Molecular Biology

Machine learning techniques for protein function prediction

Rosalin Bonetta et al.

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS (2020)

Article Biochemical Research Methods

iMRM: a platform for simultaneously identifying multiple kinds of RNA modifications

Kewei Liu et al.

BIOINFORMATICS (2020)

Article Automation & Control Systems

RBPro-RF: Use Chou's 5-steps rule to predict RNA-binding proteins via random forest with elastic net

Xiaomeng Sun et al.

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS (2020)

Article Biochemical Research Methods

Screening of SLE-susceptible SNPs in One Chinese Family with Systemic Lupus Erythematosus

Juan Luo et al.

CURRENT BIOINFORMATICS (2020)

Review Chemistry, Medicinal

Machine intelligence in peptide therapeutics: A next-generation tool for rapid disease screening

Shaherin Basith et al.

MEDICINAL RESEARCH REVIEWS (2020)

Review Neurosciences

Classification of Midbrain Dopamine Neurons Using Single-Cell Gene Expression Profiling Approaches

Jean-Francois Poulin et al.

TRENDS IN NEUROSCIENCES (2020)

Article Biotechnology & Applied Microbiology

RF-PseU: A Random Forest Predictor for RNA Pseudouridine Sites

Zhibin Lv et al.

FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY (2020)

Article Automation & Control Systems

DNNAce: Prediction of prokaryote lysine acetylation sites through deep neural networks with multi-information fusion

Bin Yu et al.

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS (2020)

Article Medicine, Research & Experimental

Is There Any Sequence Feature in the RNA Pseudouridine Modification Prediction Problem?

Lijun Dou et al.

MOLECULAR THERAPY-NUCLEIC ACIDS (2020)

Article Biochemistry & Molecular Biology

TargetCPP: accurate prediction of cell-penetrating peptides from optimized multi-scale features using gradient boost decision tree

Muhammad Arif et al.

JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN (2020)

Article Multidisciplinary Sciences

Two-Level Protein Methylation Prediction using structure model-based features

Wei Zheng et al.

SCIENTIFIC REPORTS (2020)

Article Biotechnology & Applied Microbiology

Developing a Multi-Layer Deep Learning Based Predictive Model to Identify DNA N4-Methylcytosine Modifications

Rao Zeng et al.

FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY (2020)

Article Biology

CWLy-SVM: A support vector machine-based tool for identifying cell wall lytic enzymes

Chaolu Meng et al.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2020)

Article Biochemistry & Molecular Biology

Exploring Drug Treatment Patterns Based on the Action of Drug and Multilayer Network Model

Liang Yu et al.

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES (2020)

Article Medicine, Research & Experimental

Prediction of m5C Modifications in RNA Sequences by Combining Multiple Sequence Features

Lijun Dou et al.

MOLECULAR THERAPY-NUCLEIC ACIDS (2020)

Review Biotechnology & Applied Microbiology

Review on the Application of Machine Learning Algorithms in the Sequence Data Mining of DNA

Aimin Yang et al.

FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY (2020)

Article Biochemical Research Methods

Identification of sub-Golgi protein localization by use of deep representation learning features

Zhibin Lv et al.

BIOINFORMATICS (2020)

Article Biochemistry & Molecular Biology

Single-cell RNA-seq analysis of mouse preimplantation embryos by third-generation sequencing

Xiaoying Fan et al.

PLOS BIOLOGY (2020)

Article Computer Science, Information Systems

A Dynamic-Time Distance Based on Wavelet Decomposition for Subcellular Localization Classification

Cuifang Gao et al.

IEEE ACCESS (2020)

Article Computer Science, Artificial Intelligence

Power spectrum and dynamic time warping for DNA sequences classification

Abdesselem Dakhli et al.

EVOLVING SYSTEMS (2020)

Review Biochemical Research Methods

Evaluation of different computational methods on 5-methylcytosine sites identification

Hao Lv et al.

BRIEFINGS IN BIOINFORMATICS (2020)

Article Biochemistry & Molecular Biology

CirRNAPL: A web server for the identification of circRNA based on extreme learning machine

Mengting Niu et al.

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL (2020)

Article Biochemistry & Molecular Biology

Computational identification of N6-methyladenosine sites in multiple tissues of mammals

Fu-Ying Dao et al.

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL (2020)

Article Biochemical Research Methods

4mCPred: machine learning methods for DNA N4-methylcytosine sites prediction

Wenying He et al.

BIOINFORMATICS (2019)

Article Computer Science, Software Engineering

Comparing Similarity Perception in Time Series Visualizations

Anna Gogolouis et al.

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS (2019)

Article Biochemical Research Methods

PyFeat: a Python-based effective feature generation tool for DNA, RNA and protein sequences

Rafsanjani Muhammod et al.

BIOINFORMATICS (2019)

Article Biochemical Research Methods

A deep learning method to more accurately recall known lysine acetylation sites

Meiqi Wu et al.

BMC BIOINFORMATICS (2019)

Article Biochemical Research Methods

Selene: a PyTorch-based deep learning library for sequence data

Kathleen M. Chen et al.

NATURE METHODS (2019)

Article Biochemical Research Methods

Fast interpolation-based t-SNE for improved visualization of single-cell RNA-seq data

George C. Linderman et al.

NATURE METHODS (2019)

Article Biochemistry & Molecular Biology

TriPepSVM: de novo prediction of RNA-binding proteins based on short amino acid motifs

Annkatrin Bressin et al.

NUCLEIC ACIDS RESEARCH (2019)

Article Biochemical Research Methods

Iterative feature representations improve N4-methylcytosine site prediction

Leyi Wei et al.

BIOINFORMATICS (2019)

Article Biochemical Research Methods

A Micro-aggregation Algorithm Based on Density Partition Method for Anonymizing Biomedical Data

Xiang Wu et al.

CURRENT BIOINFORMATICS (2019)

Review Biochemistry & Molecular Biology

Current best practices in single-cell RNA-seq analysis: a tutorial

Malte D. Luecken et al.

MOLECULAR SYSTEMS BIOLOGY (2019)

Editorial Material Biochemical Research Methods

Protein Function Prediction: From Traditional Classifier to Deep Learning

Zhibin Lv et al.

PROTEOMICS (2019)

Article Medicine, Research & Experimental

iPseU-CNN: Identifying RNA Pseudouridine Sites Using Convolutional Neural Networks

Muhammad Tahir et al.

MOLECULAR THERAPY-NUCLEIC ACIDS (2019)

Article Medicine, Research & Experimental

Meta-4mCpred: A Sequence-Based Meta-Predictor for Accurate DNA 4mC Site Prediction Using Effective Feature Representation

Balachandran Manavalan et al.

MOLECULAR THERAPY-NUCLEIC ACIDS (2019)

Article Biochemical Research Methods

i6mA-Pred: identifying DNA N6 - methyladenine sites in the rice genome

Wei Chen et al.

BIOINFORMATICS (2019)

Article Biotechnology & Applied Microbiology

Dimensionality reduction for visualizing single-cell data using UMAP

Etienne Becht et al.

NATURE BIOTECHNOLOGY (2019)

Review Genetics & Heredity

Challenges in unsupervised clustering of single-cell RNA-seq data

Vladimir Yu Kiselev et al.

NATURE REVIEWS GENETICS (2019)

Review Biotechnology & Applied Microbiology

Research progress in protein posttranslational modification site prediction

Wenying He et al.

BRIEFINGS IN FUNCTIONAL GENOMICS (2019)

Article Biochemical Research Methods

RNA methylation and diseases: experimental results, databases, Web servers and computational models

Xing Chen et al.

BRIEFINGS IN BIOINFORMATICS (2019)

Article Biotechnology & Applied Microbiology

Feature selection and dimension reduction for single-cell RNA-Seq based on a multinomial model

F. William Townes et al.

GENOME BIOLOGY (2019)

Article Computer Science, Information Systems

iIM-CNN: Intelligent Identifier of 6mA Sites on Different Species by Using Convolution Neural Network

Abdul Wahab et al.

IEEE ACCESS (2019)

Article Computer Science, Information Systems

iPhoPred: A Predictor for Identifying Phosphorylation Sites in Human Protein

Shi-Hao Li et al.

IEEE ACCESS (2019)

Article Biochemical Research Methods

Ultra-fast global homology detection with Discrete Cosine Transform and Dynamic Time Warping

Daniele Raimondi et al.

BIOINFORMATICS (2018)

Article Biochemical Research Methods

ProAcePred: prokaryote lysine acetylation sites prediction based on elastic net feature optimization

Guodong Chen et al.

BIOINFORMATICS (2018)

Review Biochemistry & Molecular Biology

The Human Transcription Factors

Samuel A. Lambert et al.

Article Computer Science, Software Engineering

EventThread: Visual Summarization and Stage Analysis of Event Sequence Data

Shunan Guo et al.

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS (2018)

Article Computer Science, Software Engineering

LSTMVis: A Tool for Visual Analysis of Hidden State Dynamics in Recurrent Neural Networks

Hendrik Strobelt et al.

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS (2018)

Article Computer Science, Hardware & Architecture

Dynamic, Fine-Grained Data Plane Monitoring With Monocle

Peter Peresini et al.

IEEE-ACM TRANSACTIONS ON NETWORKING (2018)

Article Biochemistry & Molecular Biology

BCseq: accurate single cell RNA-seq quantification with bias correction

Liang Chen et al.

NUCLEIC ACIDS RESEARCH (2018)

Article Biochemistry & Molecular Biology

BERMP: a cross-species classifier for predicting m(6)A sites by integrating a deep learning algorithm and a random forest approach

Yu Huang et al.

INTERNATIONAL JOURNAL OF BIOLOGICAL SCIENCES (2018)

Review Biochemistry & Molecular Biology

A Comprehensive Review of In silico Analysis for Protein S-sulfenylation Sites

Md Mehedi Hasan et al.

PROTEIN AND PEPTIDE LETTERS (2018)

Review Medicine, Research & Experimental

An Introduction to the Analysis of Single-Cell RNA-Sequencing Data

Aisha A. AlJanahi et al.

MOLECULAR THERAPY-METHODS & CLINICAL DEVELOPMENT (2018)

Article Biochemical Research Methods

iRNA-2OM: A Sequence-Based Predictor for Identifying 2′-O-Methylation Sites in Homo sapiens

Hui Yang et al.

JOURNAL OF COMPUTATIONAL BIOLOGY (2018)

Review Biochemistry & Molecular Biology

Single-cell RNA sequencing technologies and bioinformatics pipelines

Byungjin Hwang et al.

EXPERIMENTAL AND MOLECULAR MEDICINE (2018)

Article Biochemical Research Methods

PhosPred-RF: A Novel Sequence-Based Predictor for Phosphorylation Sites Using Sequential Information Only

Leyi Wei et al.

IEEE TRANSACTIONS ON NANOBIOSCIENCE (2017)

Article Biochemical Research Methods

CPPred-RF: A Sequence-based Predictor for Identifying Cell Penetrating Peptides and Their Uptake Efficiency

Leyi Wei et al.

JOURNAL OF PROTEOME RESEARCH (2017)

Article Biochemical Research Methods

Single-cell mRNA quantification and differential analysis with Census

Xiaojie Qiu et al.

NATURE METHODS (2017)

Article Medicine, Research & Experimental

2L-piRNA: A Two-Layer Ensemble Classifier for Identifying Piwi-Interacting RNAs and Their Function

Bin Liu et al.

MOLECULAR THERAPY-NUCLEIC ACIDS (2017)

Article Multidisciplinary Sciences

Detecting N6-methyladenosine sites from RNA transcriptomes using ensemble Support Vector Machines

Wei Chen et al.

SCIENTIFIC REPORTS (2017)

Review Oncology

Single cell sequencing: a distinct new field

Jian Wang et al.

CLINICAL AND TRANSLATIONAL MEDICINE (2017)

Article Computer Science, Software Engineering

ThermalPlot: Visualizing Multi-Attribute Time-Series Data Using a Thermal Metaphor

Holger Stitz et al.

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS (2016)

Article Biology

Protein fold recognition using HMM-HMM alignment and dynamic programming

James Lyons et al.

JOURNAL OF THEORETICAL BIOLOGY (2016)

Article Biotechnology & Applied Microbiology

Wishbone identifies bifurcating developmental trajectories from single-cell data

Manu Setty et al.

NATURE BIOTECHNOLOGY (2016)

Article Biochemistry & Molecular Biology

TSCAN: Pseudo-time reconstruction and evaluation in single-cell RNA-seq analysis

Zhicheng Ji et al.

NUCLEIC ACIDS RESEARCH (2016)

Article Biotechnology & Applied Microbiology

SLICER: inferring branched, nonlinear cellular trajectories from single cell RNA-seq data

Joshua D. Welch et al.

GENOME BIOLOGY (2016)

Article Cell & Tissue Engineering

Single-Cell RNA-Seq with Waterfall Reveals Molecular Cascades underlying Adult Neurogenesis

Jaehoon Shin et al.

CELL STEM CELL (2015)

Article Chemistry, Medicinal

Improving tRNAscan-SE Annotation Results via Ensemble Classifiers

Quan Zou et al.

MOLECULAR INFORMATICS (2015)

Article Multidisciplinary Sciences

Single-cell messenger RNA sequencing reveals rare intestinal cell types

Dominic Grun et al.

NATURE (2015)

Article Immunology

Aire controls gene expression in the thymic epithelium with ordered stochasticity

Matthew Meredith et al.

NATURE IMMUNOLOGY (2015)

Article Multidisciplinary Sciences

Recent advances of DNA sequencing via nanopore-based technologies

Bing-Yuan Guo et al.

SCIENCE BULLETIN (2015)

Article Computer Science, Software Engineering

Visual Analysis of Time-Series Similarities for Anomaly Detection in Sensor Networks

Martin Steiger et al.

COMPUTER GRAPHICS FORUM (2014)

Article Developmental Biology

Single cell dissection of early kidney development: multilineage priming

Eric W. Brunskill et al.

DEVELOPMENT (2014)

Article Biochemical Research Methods

Bayesian approach to single-cell differential expression analysis

Peter V. Kharchenko et al.

NATURE METHODS (2014)

Review Genetics & Heredity

Identifying and mitigating bias in next-generation sequencing methods for chromatin biology

Clifford A. Meyer et al.

NATURE REVIEWS GENETICS (2014)

Article Multidisciplinary Sciences

Bifurcation analysis of single-cell gene expression data reveals epigenetic landscape

Eugenio Marco et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2014)

Article Biochemical Research Methods

Classification of genomic signals using dynamic time warping

Helena Skutkova et al.

BMC BIOINFORMATICS (2013)

Article Computer Science, Software Engineering

TimeBench: A Data Model and Software Library for Visual Analytics of Time-Oriented Data

Alexander Rind et al.

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS (2013)

Review Biochemistry & Molecular Biology

Computational Analysis of Phosphoproteomics: Progresses and Perspectives

Jian Ren et al.

CURRENT PROTEIN & PEPTIDE SCIENCE (2011)

Article Biochemical Research Methods

CD-HIT Suite: a web server for clustering and comparing biological sequences

Ying Huang et al.

BIOINFORMATICS (2010)

Review Biochemistry & Molecular Biology

Peptide and protein de novo sequencing by mass spectrometry

KG Standing

CURRENT OPINION IN STRUCTURAL BIOLOGY (2003)

Review Multidisciplinary Sciences

Mass spectrometry-based proteomics

R Aebersold et al.

NATURE (2003)