4.7 Article

STALLION: a stacking-based ensemble learning framework for prokaryotic lysine acetylation site prediction

Related references

Note: Only part of the references are listed.
Review Biochemistry & Molecular Biology

Recent Trends on the Development of Machine Learning Approaches for the Prediction of Lysine Acetylation Sites

Shaherin Basith et al.

Summary: Acetylation on lysine residues is an important protein post-translational modification, and the development of computational tools based on machine learning has become crucial due to the cost-ineffectiveness and time-consumption of experimental methods. This review provides an overview of the state-of-the-art machine learning predictors for lysine acetylation, including databases, algorithms, feature selection methods, validation techniques, and software utility. It also discusses the limitations and future directions of machine learning approaches for predicting lysine acetylation sites.

CURRENT MEDICINAL CHEMISTRY (2022)

Article Biochemical Research Methods

Computational prediction and interpretation of both general and specific types of promoters in Escherichia coli by exploiting a stacked ensemble-learning framework

Fuyi Li et al.

Summary: Promoters are short DNA sequences responsible for gene transcription activation or repression in bacteria, and identifying them accurately is crucial for understanding their functions. Current computational methods for promoter classification have shown unsatisfactory performance, prompting the development of a novel stacked-ensemble approach called SELECTOR. SELECTOR outperformed existing methods in predicting both general and specific types of promoters in Escherichia coli, providing crucial insights through the Shapley Additive exPlanation algorithm.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Biochemical Research Methods

iCarPS: a computational tool for identifying protein carbonylation sites by novel encoded features

Dan Zhang et al.

Summary: In this study, a predictor called iCarPS was developed to identify carbonylation sites based on sequence information, utilizing a novel feature encoding scheme and feature selection technique. Experiments demonstrated the accuracy and robustness of iCarPS, showing its powerful performance in carbonylation sites identification compared to other published methods.

BIOINFORMATICS (2021)

Article Biochemical Research Methods

DeepVF: a deep learning-based hybrid framework for identifying virulence factors using the stacking strategy

Ruopeng Xie et al.

Summary: Virulence factors (VFs) play a crucial role in pathogen-host interactions. Current methods for predicting VFs face challenges in identifying novel factors and lack systematic feature engineering efforts. Addressing these issues could lead to improved accuracy in VF prediction.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Biochemical Research Methods

Meta-i6mA: an interspecies predictor for identifying DNA N6-methyladenine sites of plant genomes by exploiting informative features in an integrative machine-learning framework

Md Mehedi Hasan et al.

Summary: DNA N-6-methyladenine (6mA) is a crucial epigenetic modification responsible for various cellular processes. Accurate identification of 6mA sites is challenging in genome analysis, and existing machine learning models have limited practical application across different plant species. In this study, Meta-i6mA, combining 30 baseline models, performed exceptionally well in independent tests on Rosaceae, rice, and Arabidopsis thaliana, showing higher Matthews correlation coefficient values compared to existing predictors.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Biochemistry & Molecular Biology

BastionHub: a universal platform for integrating and analyzing substrates secreted by Gram-negative bacteria

Jiawei Wang et al.

Summary: Gram-negative bacteria use secretion systems to export substrates for survival. BastionHub, a comprehensive online database, integrates different types of secreted substrates and provides advanced analysis tools for biologists.

NUCLEIC ACIDS RESEARCH (2021)

Review Biochemical Research Methods

Large-scale comparative review and assessment of computational methods for anti-cancer peptide identification

Xiao Liang et al.

Summary: Anti-cancer peptides (ACPs) are potential therapeutics for cancer with the ability to target cancer cells specifically. Various machine learning methods have been developed for in silico identification of ACPs, leading to extensive research on their therapeutic mechanisms. Summarizing the advantages and disadvantages of existing methods and providing suggestions for improvement is necessary to advance the accurate identification of ACPs.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Biochemical Research Methods

BERT4Bitter: a bidirectional encoder representations from transformers (BERT)-based model for improving the prediction of bitter peptides

Phasit Charoenkwan et al.

Summary: The BERT4Bitter model presented in this study utilizes a bidirectional encoder representation from transformers to predict bitter peptides directly from their amino acid sequence, achieving superior performance. Compared to traditional machine learning models, BERT4Bitter shows significant improvements in accuracy and correlation coefficient, offering a new approach for identifying bitter peptides.

BIOINFORMATICS (2021)

Article Biochemistry & Molecular Biology

PredNTS: Improved and Robust Prediction of Nitrotyrosine Sites by Integrating Multiple Sequence Features

Andi Nur Nilamyani et al.

Summary: Nitrotyrosine, a type of protein post-translational modification, is generated by reactive nitrogen species. Computational prediction, such as the PredNTS predictor developed in this study, plays a vital role in understanding nitrated proteins before biological experimentation. The PredNTS predictor outperforms existing predictors and provides a useful computational resource for predicting nitrotyrosine sites.

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES (2021)

Article Biochemical Research Methods

Prediction and analysis of multiple protein lysine modified sites based on conditional wasserstein generative adversarial networks

Yingxi Yang et al.

Summary: The MultiLyGAN machine learning pipeline showed good predictive performance for identifying lysine modified sites in proteins, with CWGAN being effective in addressing data imbalance. CKSAAP, PWM, and structural features were identified as the most important feature-encoding schemes.

BMC BIOINFORMATICS (2021)

Article Biochemical Research Methods

NeuroPred-FRL: an interpretable prediction model for identifying neuropeptide using feature representation learning

Md Mehedi Hasan et al.

Summary: Neuropeptides play a crucial role in regulating immune systems, and rapid and accurate identification of them is essential for basic research and drug development. Using machine learning and feature representation learning, the developed NeuroPred-FRL predictor demonstrates superior prediction performance, serving as a powerful tool for large-scale identification of neuropeptides.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Biochemical Research Methods

StackIL6: a stacking ensemble model for improving the prediction of IL-6 inducing peptides

Phasit Charoenkwan et al.

Summary: IL-6 release is stimulated by antigenic peptides and immune cells, making IL-6 inducing peptides useful not only as diagnostic biomarkers, but also as inhibitors for aggressive immune responses. A novel stacking ensemble model, StackIL6, was developed using twelve feature descriptors and five machine learning algorithms, showing better performance than existing methods in identifying IL-6 inducing peptides. Accessible through a web server, StackIL6 has the potential to aid in the rapid screening of promising peptides for diagnostic and immunotherapeutic applications.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Biotechnology & Applied Microbiology

mRNALocater: Enhance the prediction accuracy of eukaryotic mRNA subcellular localization by using model fusion strategy

Qiang Tang et al.

Summary: An innovative method called mRNALocater was proposed to detect the subcellular localization of eukaryotic mRNA by adopting a model fusion strategy. The method utilizes electron-ion interaction pseudopotential and pseudo k-tuple nucleotide composition to encode sequences and employs correlation coefficient filtering algorithm and feature forward search technology to mine hidden feature information, improving prediction accuracy. Results from independent dataset tests show promising performances for predicting eukaryotic mRNA sub-cellular localizations, making mRNALocater a powerful tool in practical applications.

MOLECULAR THERAPY (2021)

Article Biochemical Research Methods

Computational prediction and interpretation of cell-specific replication origin sites from multiple eukaryotes by exploiting stacking framework

Leyi Wei et al.

Summary: The study introduced a novel machine learning approach called Stack-ORI to identify replication origin sites (ORIs) in four different eukaryotic species. Results showed that Stack-ORI outperformed baseline models on both training and independent datasets, consistently achieving better performance across all cell-specific models. The novel approach also provided necessary explanations for model success, highlighting the most important feature encoding schemes significant for predicting cell-specific ORIs.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Chemistry, Medicinal

iATP: A Sequence Based Method for identifying Anti-tubercular Peptides

Wei Chen et al.

MEDICINAL CHEMISTRY (2020)

Article Biochemical Research Methods

iMRM: a platform for simultaneously identifying multiple kinds of RNA modifications

Kewei Liu et al.

BIOINFORMATICS (2020)

Review Chemistry, Medicinal

Machine intelligence in peptide therapeutics: A next-generation tool for rapid disease screening

Shaherin Basith et al.

MEDICINAL RESEARCH REVIEWS (2020)

Article Biochemistry & Molecular Biology

Meta-iPVP: a sequence-based meta-predictor for improving the prediction of phage virion proteins using effective feature representation

Phasit Charoenkwan et al.

JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN (2020)

Article Biochemistry & Molecular Biology

ProIn-Fuse: improved and robust prediction of proinflammatory peptides by fusing of multiple feature representations

Mst. Shamima Khatun et al.

JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN (2020)

Review Biochemistry & Molecular Biology

The Chemical Biology of Reversible Lysine Post-translational Modifications

Zhipeng A. Wang et al.

CELL CHEMICAL BIOLOGY (2020)

Review Computer Science, Artificial Intelligence

An Introductory Review of Deep Learning for Prediction Models With Big Data

Frank Emmert-Streib et al.

FRONTIERS IN ARTIFICIAL INTELLIGENCE (2020)

Article Biochemical Research Methods

Iterative feature representations improve N4-methylcytosine site prediction

Leyi Wei et al.

BIOINFORMATICS (2019)

Article Medicine, Research & Experimental

Meta-4mCpred: A Sequence-Based Meta-Predictor for Accurate DNA 4mC Site Prediction Using Effective Feature Representation

Balachandran Manavalan et al.

MOLECULAR THERAPY-NUCLEIC ACIDS (2019)

Article Biochemical Research Methods

Bastion3: a two-layer ensemble predictor of type III secreted effectors

Jiawei Wang et al.

BIOINFORMATICS (2019)

Article Medicine, Research & Experimental

SDM6A: A Web-Based Integrative Machine-Learning Framework for Predicting 6mA Sites in the Rice Genome

Shaherin Basith et al.

MOLECULAR THERAPY-NUCLEIC ACIDS (2019)

Article Biochemistry & Molecular Biology

Prediction of S-nitrosylation sites by integrating support vector machines and random forest

Md Mehedi Hasan et al.

MOLECULAR OMICS (2019)

Article Biochemical Research Methods

ProAcePred: prokaryote lysine acetylation sites prediction based on elastic net feature optimization

Guodong Chen et al.

BIOINFORMATICS (2018)

Article Biochemistry & Molecular Biology

PLMD: An updated data resource of protein lysine modifications

Haodong Xu et al.

JOURNAL OF GENETICS AND GENOMICS (2017)

Article Multidisciplinary Sciences

Large-scale identification of lysine acetylated proteins in vegetative hyphae of the rice blast fungus

Xiaomei Sun et al.

SCIENTIFIC REPORTS (2017)

Article Biochemical Research Methods

iPTM-mLys: identifying multiple lysine PTM sites and their different types

Wang-Ren Qiu et al.

BIOINFORMATICS (2016)

Article Multidisciplinary Sciences

GPS-PAIL: prediction of lysine acetyltransferase-specific modification sites from protein sequences

Wankun Deng et al.

SCIENTIFIC REPORTS (2016)

Review Multidisciplinary Sciences

Deep learning

Yann LeCun et al.

NATURE (2015)

Article Biotechnology & Applied Microbiology

An Intelligent System for Identifying Acetylated Lysine on Histones and Nonhistone Proteins

Cheng-Tsung Lu et al.

BIOMED RESEARCH INTERNATIONAL (2014)

Article Biochemical Research Methods

Proteome-wide Analysis of Amino Acid Variations That Influence Protein Lysine Acetylation

Sheng-Bao Suo et al.

JOURNAL OF PROTEOME RESEARCH (2013)

Article Biochemical Research Methods

Proteolytic Post-translational Modification of Proteins: Proteomic Tools and Methodology

Lindsay D. Rogers et al.

MOLECULAR & CELLULAR PROTEOMICS (2013)

Article Biochemical Research Methods

CD-HIT: accelerated for clustering the next-generation sequencing data

Limin Fu et al.

BIOINFORMATICS (2012)

Article Biochemistry & Molecular Biology

PLMLA: prediction of lysine methylation and lysine acetylation by combining multiple features

Shao-Ping Shi et al.

MOLECULAR BIOSYSTEMS (2012)

Article Chemistry, Multidisciplinary

N-Ace: Using Solvent Accessibility and Physicochemical Properties to Identify Protein N-Acetylation Sites

Tzong-Yi Lee et al.

JOURNAL OF COMPUTATIONAL CHEMISTRY (2010)

Article Biochemistry & Molecular Biology

Improved Prediction of Lysine Acetylation by Support Vector Machines

Songling Li et al.

PROTEIN AND PEPTIDE LETTERS (2009)

Article Biochemistry & Molecular Biology

AAindex: amino acid index database, progress report 2008

Shuichi Kawashima et al.

NUCLEIC ACIDS RESEARCH (2008)

Article Multidisciplinary Sciences

Predictina protein-protein interactions based only on sequences information

Juwen Shen et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2007)

Article Biochemistry & Molecular Biology

Prediction of Nε-acetylation on internal lysines implemented in Bayesian Discriminant Method

Ao Li et al.

BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS (2006)

Article Computer Science, Artificial Intelligence

Extremely randomized trees

P Geurts et al.

MACHINE LEARNING (2006)

Article Biochemistry & Molecular Biology

dbPTM: an information repository of protein post-translational modification

Tzong-Yi Lee et al.

NUCLEIC ACIDS RESEARCH (2006)

Article Computer Science, Artificial Intelligence

Is combining classifiers with stacking better than selecting the best one?

S Dzeroski et al.

MACHINE LEARNING (2004)

Article Computer Science, Artificial Intelligence

Random forests

L Breiman

MACHINE LEARNING (2001)

Review Biochemistry & Molecular Biology

Nα-terminal acetylation of eukaryotic proteins

B Polevoda et al.

JOURNAL OF BIOLOGICAL CHEMISTRY (2000)