4.7 Article

Machine learning-based model for accurate identification of druggable proteins using light extreme gradient boosting

Related references

Note: Only part of the references are listed.
Article Chemistry, Medicinal

AFP-SPTS: An Accurate Prediction of Antifreeze Proteins Using Sequential and Pseudo-Tri-Slicing Evolutionary Features with an Extremely Randomized Tree

Adnan Khan et al.

Summary: The development of intracellular ice in cold-blooded organisms can be fatal, but they produce antifreeze proteins (AFPs) to survive in subzero temperatures. AFPs also have applications in various fields. This study introduces a new approach called AFP-SPTS for accurate AFP prediction. By exploring different features and using ensemble learning with machine learning algorithms, the proposed predictor achieved improved accuracies compared to existing models.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2023)

Review Computer Science, Interdisciplinary Applications

Recent Advances in Machine Learning-Based Models for Prediction of Antiviral Peptides

Farman Ali et al.

Summary: Viruses like COVID-19, HIV, and hepatitis have caused illness and death on a global scale. Antiviral peptides (AVPs) have been used to develop drugs for these diseases, and accurate prediction of AVPs is crucial. This study thoroughly examines existing predictors of AVPs, discussing datasets, feature representation methods, classification algorithms, and evaluation parameters. It also highlights limitations of current studies and suggests future improvements for more accurate AVP prediction.

ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING (2023)

Article Automation & Control Systems

Comparative analysis of the existing methods for prediction of antifreeze proteins

Adnan Khan et al.

Summary: Antifreeze proteins (AFPs) are found in various organisms and play a crucial role in preventing the formation of ice crystals. The development of accurate predictors for identifying AFPs is essential. This review article provides a comprehensive summary of existing AFP predictors, including their applied datasets, feature descriptors, model training classifiers, performance assessment parameters, and web servers. The drawbacks of current predictors are highlighted, and suggestions for future improvements, such as more effective feature descriptors and efficient classifiers, are discussed.

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS (2023)

Article Biochemistry & Molecular Biology

Deep-AGP: Prediction of angiogenic protein by integrating two-dimensional convolutional neural network with discrete cosine transform

Farman Ali et al.

Summary: Angiogenic proteins (AGPs) have various applications in cancer and are important in understanding cardiovascular and neurodegenerative diseases. In this research, a computational model using deep learning was established to identify AGPs. By constructing a sequence-based dataset and designing a novel feature encoder, the highest success rate was achieved with a proposed new feature descriptor in a 2D-CNN model. The proposed method (Deep-AGP) has potential for use in cancer research and the development of therapeutic methods for cardiovascular and neurodegenerative diseases.

INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES (2023)

Article Biochemistry & Molecular Biology

A bi-layer model for identification of piwiRNA using deep neural learning

Adnan Adnan et al.

Summary: piwiRNA is a type of ncRNA that cannot produce proteins and plays a crucial role in gamete generation and gene expression regulation. It has various functions such as deadenylation, animal fertility, transposon silencing, anti-viral activity, and endogenous gene regulation. The prediction of piwiRNA is of great significance and improving its prediction is highly desirable.

JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS (2023)

Article Computer Science, Artificial Intelligence

DrugFinder: Druggable Protein Identification Model Based on Pre-Trained Models and Evolutionary Information

Mu Zhang et al.

Summary: Identifying druggable proteins is crucial for drug development. Traditional structure-based methods are time-consuming and expensive, leading to the increasing shift towards sequence-based methods. In this study, we propose a sequence-based model called DrugFinder, which extracts features from the embedding output of a pre-trained protein model and the evolutionary information of the position-specific scoring matrix. We used the random forest method to select features and tested them on various machine learning classifiers, with the XGB model achieving the best results. DrugFinder showed significantly better performance than existing methods, achieving high accuracy, sensitivity, and specificity on independent test sets.

ALGORITHMS (2023)

Article Computer Science, Information Systems

Prediction of Amyloid Proteins Using Embedded Evolutionary & Ensemble Feature Selection Based Descriptors With eXtreme Gradient Boosting Model

Shahid Akbar et al.

Summary: Amyloid proteins (AMYs) are insoluble fibrous aggregates with major pathogenic effects on tissues, and their abnormal deposition can lead to several diseases. Accurately predicting AMYs and their pathogenic characteristics is crucial, and various computational predictors have been proposed. This study introduces an intelligent computational predictor that combines evolutionary features, coupling information from AMY sequences, and machine learning techniques to achieve improved prediction accuracy of 93.10% on training sequences and 89.67% on independent sequences, surpassing existing predictors by 4%. The predictive approach is anticipated to be valuable in drug development and academic research.

IEEE ACCESS (2023)

Article Engineering, Biomedical

Identification of antioxidant proteins using a discriminative intelligent model of k-space amino acid pairs based descriptors incorporating with ensemble feature selection

Ashfaq Ahmad et al.

Summary: This study developed an intelligent model that accurately identifies antioxidant proteins and evaluates their effectiveness in the human body. By utilizing evolutionary and sequence-based numerical descriptors, combined with feature selection and support vector machine algorithm, the model achieved high accuracy on both training and independent datasets, outperforming existing computational models.

BIOCYBERNETICS AND BIOMEDICAL ENGINEERING (2022)

Article Automation & Control Systems

Deep-PCL: A deep learning model for prediction of cancerlectins and non cancerlectins using optimized integrated features

Adnan et al.

Summary: A fast deep learning model was developed to discriminate cancerlectins from noncancerlectins. By combining and selecting different feature descriptive techniques, the model was able to accurately and efficiently distinguish cancerlectins.

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS (2022)

Article Automation & Control Systems

iAFPs-EnC-GA: Identifying antifungal peptides using sequential and evolutionary descriptors based multi-information fusion and ensemble learning approach

Ashfaq Ahmad et al.

Summary: Fungal infections are a global health concern, and existing treatments have severe side effects. This study presents an intelligent learning approach to accurately predict antifungal peptides by exploring sequential and evolutionary features. The proposed iAFPs-EnC-GA model achieves a high prediction accuracy and has the potential to play a key role in drug development and academic research.

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS (2022)

Article Biology

Target-DBPPred: An intelligent model for prediction of DNA-binding proteins using discrete wavelet transform based compression and light eXtreme gradient boosting

Farman Ali et al.

Summary: This study presents an intelligent computational method, Target-DBPPred, to improve the accurate prediction of DBPs. The method combines different evolutionary methods and utilizes a new feature descriptor to capture diverse multivariate features. Experimental results show that the proposed method achieves higher accuracies on both training and testing datasets.

COMPUTERS IN BIOLOGY AND MEDICINE (2022)

Article Multidisciplinary Sciences

XGB-DrugPred: computational prediction of druggable proteins using eXtreme gradient boosting and optimized features set

Rahu Sikander et al.

Summary: This study proposes a machine learning-based method for accurate prediction of drug targets. By extracting features from protein sequences and using feature selection and classifiers for training and prediction, this method achieves the best results in empirical experiments.

SCIENTIFIC REPORTS (2022)

Article Automation & Control Systems

ACP-2DCNN: Deep learning-based model for improving prediction of anticancer peptides using two-dimensional convolutional neural network

Ali Ghulam et al.

Summary: Cancer, the most dangerous disease for humans, has caused numerous deaths and suffering. Researchers have focused on using cancer peptides for cancer treatment. However, predicting anticancer peptides through experimental methods is costly and laborious, leading to a demand for advanced algorithms. This study proposes a deep learning-based method, ACP-2DCNN, which achieves better prediction accuracy for anticancer peptides compared to existing methods.

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

DBP-CNN: Deep learning-based prediction of DNA-binding proteins by coupling discrete cosine transform with two-dimensional convolutional neural network

Omar Barukab et al.

Summary: This paper proposes a deep learning-based method, DBP-CNN, to improve the prediction of DNA-binding Proteins (DBPs). A novel feature descriptor, PSSM-TS-DCT, is designed to efficiently extract important features, and compared with conventional feature descriptors. Experimental results show that DBP-CNN predicts DBPs more accurately than existing predictors.

EXPERT SYSTEMS WITH APPLICATIONS (2022)

Article Multidisciplinary Sciences

Prediction of antifreeze proteins using machine learning

Adnan Khan et al.

Summary: In this research, a novel computational method called AFP-LXGB has been proposed for more precise prediction of antifreeze proteins (AFPs). By exploring information through various feature sets and selecting the best feature set, the method has shown significant improvements in prediction accuracy. This approach has important applications in fields such as medicine, agriculture, industry, and biotechnology.

SCIENTIFIC REPORTS (2022)

Article Multidisciplinary Sciences

Computational prediction and interpretation of druggable proteins using a stacked ensemble-learning framework

Phasit Charoenkwan et al.

Summary: This study introduces a computational tool, SPIDER, for accurate prediction of druggable proteins. By employing various feature descriptors and machine learning algorithms, SPIDER enables more precise and robust prediction compared to baseline models and existing methods in an independent test dataset.

ISCIENCE (2022)

Article Engineering, Biomedical

Deep-GHBP: Improving prediction of Growth Hormone-binding proteins using deep learning model

Farman Ali et al.

Summary: This study presents an innovative approach for the prediction of Growth hormone-binding proteins (GHBPs) using PSSM, CTD, Geary, Moran, and PseAAC. Important features were selected using a novel MMC feature selection algorithm and the prediction was done using DNN as a classifier. The results showed that this method outperformed existing predictors in terms of accuracy.

BIOMEDICAL SIGNAL PROCESSING AND CONTROL (2022)

Article Automation & Control Systems

DBP-DeepCNN: Prediction of DNA-binding proteins using wavelet-based denoising and deep learning

Farman Ali et al.

Summary: In this study, a deep learning-based predictor (DBP-DeepCNN) is proposed to improve the prediction of DNA-binding proteins (DBPs). By using a novel feature extraction method and training with various models, the predictor achieved higher accuracies on both training and independent datasets, indicating its potential for large scale DBP prediction and promising therapeutic strategies for chronic diseases.

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS (2022)

Article Mathematical & Computational Biology

DBP-iDWT: Improving DNA-Binding Proteins Prediction Using Multi-Perspective Evolutionary Profile and Discrete Wavelet Transform

Farman Ali et al.

Summary: DNA-binding proteins (DBPs) play crucial roles in biological activities, and this research designed a novel predictor to accurately identify DBPs by transforming sequence features and introducing new descriptors. Experimental results demonstrate superior performance of the model and its potential value in therapeutic strategies for fatal diseases.

COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE (2022)

Article Biochemical Research Methods

Predicting drug-target binding affinity through molecule representation block based on multi-head attention and skip connection

Li Zhang et al.

Summary: This study presents a novel model called MRBDTA to improve the existing computational models for drug-target binding affinity prediction. MRBDTA achieves better performance in prediction accuracy and can provide interpretability analysis. The case studies demonstrate the reliable performance of MRBDTA in drug design for SARS-CoV-2.

BRIEFINGS IN BIOINFORMATICS (2022)

Article Biochemical Research Methods

DAESTB: inferring associations of small molecule-miRNA via a scalable tree boosting model based on deep autoencoder

Li Peng et al.

Summary: A new computational method based on a deep autoencoder and a scalable tree boosting model is proposed to predict associations between small molecule and miRNA. The method outperforms state-of-the-art methods in terms of accuracy and the predicted associations are confirmed in case studies.

BRIEFINGS IN BIOINFORMATICS (2022)

Article Automation & Control Systems

iDBP-PBMD: A machine learning model for detection of DNA-binding proteins by extending compression techniques into evolutionary profile

Ameen Banjar et al.

Summary: DNA-binding proteins (DBPs) play crucial roles in DNA transcription, recombination, and replication, and are associated with diseases like AIDS/HIV, cancer, and asthma. This research encoded DBPs using different feature descriptors and eliminated noisy and redundant features using compression techniques. The resulting features were used to train models with XGBoost and ERT classifiers. The study demonstrated the superiority of this approach over previous methods.

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS (2022)

Article Automation & Control Systems

Deep-AntiFP: Prediction of antifungal peptides using distanct multi-informative features incorporating with deep neural networks

Ashfaq Ahmad et al.

Summary: Fungal infections have become a serious global issue, making accurate identification of antifungal peptides challenging for researchers. A high-performance intelligent model, DeepAntiFP, was proposed and demonstrated to outperform existing computational models in antifungal peptide identification.

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS (2021)

Article Biology

iAtbP-Hyb-EnC: Prediction of antitubercular peptides via heterogeneous feature representation and genetic algorithm based ensemble learning model

Shahid Akbar et al.

Summary: This study proposes the use of an intelligent and reliable prediction model for identifying antitubercular peptides, using an ensemble learning approach to improve results. The approach aims to compensate for the shortcomings of individual classification algorithms and show promising results in accurately predicting tuberculosis.

COMPUTERS IN BIOLOGY AND MEDICINE (2021)

Article Biology

AFP-CMBPred: Computational identification of antifreeze proteins by extending consensus sequences into multi-blocks evolutionary information

Farman Ali et al.

Summary: In extremely cold environments, organisms produce Antifreeze proteins to survive. A novel predictor AFP-CMBPred was developed based on machine learning for accurate prediction of these proteins, outperforming existing models in terms of prediction accuracy.

COMPUTERS IN BIOLOGY AND MEDICINE (2021)

Article Computer Science, Artificial Intelligence

A privacy-conserving framework based intrusion detection method for detecting and recognizing malicious behaviours in cyber-physical power networks

Izhar Ahmed Khan et al.

Summary: A privacy-conserving framework called PC-IDS is proposed for achieving the privacy and safety features of Smart Power Systems through a hybrid machine learning approach. The framework includes two key components: data pre-processing for privacy preservation and intrusion detection using a probabilistic neural network. Experimental results demonstrate that the framework can effectively protect data and identify malicious events.

APPLIED INTELLIGENCE (2021)

Review Biochemistry & Molecular Biology

Deep Learning in Virtual Screening: Recent Applications and Developments

Talia B. Kimber et al.

Summary: Machine learning plays a crucial role in drug discovery, especially in virtual screening, with a focus on deep learning techniques. Recent research mainly focuses on different compound and protein encodings, as well as data sets used for model training and testing.

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES (2021)

Article Computer Science, Artificial Intelligence

F*: an interpretable transformation of the F-measure

David J. Hand et al.

Summary: The F-measure, also known as the F1-score, is commonly used to evaluate classification algorithms, but some researchers question its intuitive interpretation and the combination of precision and recall. To address this concern, a simple transformation called F* (F-star) with practical interpretation has been introduced.

MACHINE LEARNING (2021)

Article Biochemical Research Methods

Predicting potential small molecule-miRNA associations based on bounded nuclear norm regularization

Xing Chen et al.

Summary: The study developed a computational model for predicting small molecule-miRNA associations using Bounded Nuclear Norm Regularization method. The innovation lies in limiting matrix elements within (0,1) and incorporating a regularization term to tolerate noise.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Biology

Prediction of protein-protein interaction sites through eXtreme gradient boosting with kernel principal component analysis

Xue Wang et al.

Summary: The PPISP-XGBoost method proposed in the study uses XGBoost to predict PPI sites by extracting and optimizing features, achieving higher accuracy compared to existing methods on multiple datasets. The results demonstrate the effectiveness of PPISP-XGBoost in enhancing the prediction of PPI sites.

COMPUTERS IN BIOLOGY AND MEDICINE (2021)

Article Biology

COVID-19 cough classification using machine learning and global smartphone recordings

Madhurananda Pahar et al.

Summary: This study introduced a machine learning based COVID-19 cough classifier that can effectively differentiate COVID-19 positive coughs from negative and healthy coughs, showcasing its potential as a cost-effective and deployable non-contact screening method for COVID-19.

COMPUTERS IN BIOLOGY AND MEDICINE (2021)

Article Computer Science, Information Systems

piEnPred: a bi-layered discriminative model for enhancers and their subtypes via novel cascade multi-level subset feature selection algorithm

Zaheer Ullah Khan et al.

Summary: The study focused on the prediction of enhancers by building a more robust computational bi-layered model. The first layer accurately predicts enhancers, while the second layer predicts their subtypes. The proposed method outperformed other state-of-the-art predictors and a user-friendly web server has been developed for accessibility to experimental scientists.

FRONTIERS OF COMPUTER SCIENCE (2021)

Article Biochemical Research Methods

DBP-GAPred: An intelligent method for prediction of DNA-binding proteins types by enhanced evolutionary profile features with ensemble learning

Omar Barukab et al.

Summary: This study extends the feature extraction method for protein sequences and trains a model using SVM to improve the prediction performance of DNA-binding proteins. The new predictor achieved higher accuracies than existing predictors, demonstrating the superiority of the method.

JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (2021)

Article Biochemistry & Molecular Biology

TargetCPP: accurate prediction of cell-penetrating peptides from optimized multi-scale features using gradient boost decision tree

Muhammad Arif et al.

JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN (2020)

Article Biotechnology & Applied Microbiology

Prediction of G Protein-Coupled Receptors With CTDC Extraction and MRMD2.0 Dimension-Reduction Methods

Xingyue Gu et al.

FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY (2020)

Article Automation & Control Systems

iHBP-DeepPSSM: Identifying hormone binding proteins using PsePSSM based evolutionary features and deep learning approach

Shahid Akbar et al.

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS (2020)

Review Biotechnology & Applied Microbiology

Artificial Intelligence in Drug Discovery: A Comprehensive Review of Data-driven and Machine Learning Approaches

Hyunho Kim et al.

BIOTECHNOLOGY AND BIOPROCESS ENGINEERING (2020)

Article Biochemistry & Molecular Biology

DP-BINDER: machine learning model for prediction of DNA-binding proteins by fusing evolutionary and physicochemical information

Farman Ali et al.

JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN (2019)

Article Computer Science, Artificial Intelligence

Accurate prediction of potential druggable proteins based on genetic algorithm and Bagging-SVM ensemble classifier

Jianying Lin et al.

ARTIFICIAL INTELLIGENCE IN MEDICINE (2019)

Review Biology

A review of feature selection methods in medical applications

Beatriz Remeseiro et al.

COMPUTERS IN BIOLOGY AND MEDICINE (2019)

Article Biochemical Research Methods

Prediction of membrane protein types by exploring local discriminative information from evolutionary profiles

Muhammad Kabir et al.

ANALYTICAL BIOCHEMISTRY (2019)

Article Automation & Control Systems

iPredCNC: Computational prediction model for cancerlectins and non-cancerlectins using novel cascade features subset selection

Zaheer Ullah Khan et al.

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS (2019)

Article Biochemistry & Molecular Biology

DrugBank 5.0: a major update to the DrugBank database for 2018

David S. Wishart et al.

NUCLEIC ACIDS RESEARCH (2018)

Editorial Material Pharmacology & Pharmacy

Clinical Success of Drug Targets Prospectively Predicted by In Silico Study

Feng Zhu et al.

TRENDS IN PHARMACOLOGICAL SCIENCES (2018)

Article Biochemical Research Methods

In silico re-identification of properties of drug target proteins

Baeksoo Kim et al.

BMC BIOINFORMATICS (2017)

Article Biochemical Research Methods

Drug-target interaction prediction: databases, web servers and computational models

Xing Chen et al.

BRIEFINGS IN BIOINFORMATICS (2016)

Review Genetics & Heredity

LncRNAs: emerging players in gene regulation and disease pathogenesis

Mina Kazemzadeh et al.

JOURNAL OF GENETICS (2015)

Article Oncology

Molecular Pathways: Targeting Hsp90-Who Benefits and Who Does Not

Maurizio Scaltriti et al.

CLINICAL CANCER RESEARCH (2012)

Article Biochemical Research Methods

Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework

Yoshihiro Yamanishi et al.

BIOINFORMATICS (2010)

Article Biology

Predict potential drug targets from the ion channel proteins based on SVM

Chen Huang et al.

JOURNAL OF THEORETICAL BIOLOGY (2010)

Article Biochemical Research Methods

Properties and identification of human protein drug targets

Tala M. Bakheet et al.

BIOINFORMATICS (2009)

Article Biochemical Research Methods

Supervised prediction of drug-target interactions using bipartite local models

Kevin Bleakley et al.

BIOINFORMATICS (2009)

Article Biochemical Research Methods

Protein Structure Classification Based on Conserved Hydrophobic Residues

Pradeep Chowriappa et al.

IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS (2009)

Review Chemistry, Multidisciplinary

GPCR-CA: A Cellular Automaton Image Approach for Predicting G-Protein-Coupled Receptor Functional Classes

Xuan Xiao et al.

JOURNAL OF COMPUTATIONAL CHEMISTRY (2009)

Article Biochemical Research Methods

Prediction of drug-target interaction networks from the integration of chemical and genomic spaces

Yoshihiro Yamanishi et al.

BIOINFORMATICS (2008)

Article Biochemical Research Methods

Prediction of potential drug targets based on simple sequence properties

Qingliang Li et al.

BMC BIOINFORMATICS (2007)

Review Multidisciplinary Sciences

Drug discovery: A historical perspective

J Drews

SCIENCE (2000)