4.6 Article

Deep learning and support vector machines for transcription start site identification

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Computer Science, Artificial Intelligence

NeuroTIS: Enhancing the prediction of translation initiation sites in mRNA sequences via a hybrid dependency network and deep learning framework

Chao Wei et al.

Summary: This study proposes a novel method for predicting translation initiation sites in mRNA sequences based on a hybrid dependency network and deep learning framework. By explicitly modeling label dependencies among coding regions and between coding regions and translation initiation sites, this method achieves excellent prediction performance on benchmark gene datasets, surpassing existing state-of-the-art methods.

KNOWLEDGE-BASED SYSTEMS (2021)

Article Multidisciplinary Sciences

DeepTFactor: A deep learning-based tool for the prediction of transcription factors

Gi Bae Kim et al.

Summary: Transcription factors are proteins that regulate gene expression by binding to specific DNA sequences, traditionally predicted through sequence homology analysis. However, the development of the deep learning-based tool DeepTFactor has shown high performance in predicting transcription factors efficiently and accurately, even for those with no homology to reported ones.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2021)

Article Biochemical Research Methods

Floating Search Methodology for Combining Classification Models for Site Recognition in DNA Sequences

Javier Perez-Rodriguez et al.

Summary: In this paper, a methodology for combining multiple sources of information to recognize functional sites is proposed, showing significant improvement over existing methods. The use of floating search challenges the standard assumption of using genomes that are not too close or too far from the human genome to enhance the recognition of functional sites.

IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS (2021)

Article Multidisciplinary Sciences

A deep learning framework combined with word embedding to identify DNA replication origins

Feng Wu et al.

Summary: By introducing word embedding technology and a deep learning framework, this study proposed a new method to improve the accuracy and efficiency of ORI identification. Experimental results showed that the proposed method achieved high accuracy and correlation coefficients in four species, demonstrating its stable ability and high confidence level.

SCIENTIFIC REPORTS (2021)

Article Biochemical Research Methods

DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome

Yanrong Ji et al.

Summary: This study introduces a novel pre-trained bidirectional encoder representation called DNABERT for understanding genomic DNA sequences, showing superior performance in predicting genome-wide regulatory elements.

BIOINFORMATICS (2021)

Article Computer Science, Artificial Intelligence

Comparison of machine learning and deep learning techniques in promoter prediction across diverse species

Nikita Bhandari et al.

Summary: Gene promoters, key DNA elements for regulating gene transcription, are challenging to predict due to lack of obvious features, prompting the use of machine learning and deep learning models. In this study, frequency-based tokenization was found to be effective for data pre-processing, enhancing the classification performance of 1-D CNN models. CNN was shown to outperform other models in distinguishing promoter sequences from non-promoters and species-specific classification.

PEERJ COMPUTER SCIENCE (2021)

Article Multidisciplinary Sciences

Highly accurate protein structure prediction with AlphaFold

John Jumper et al.

Summary: Proteins are essential for life, and accurate prediction of their structures is a crucial research problem. Current experimental methods are time-consuming, highlighting the need for accurate computational approaches to address the gap in structural coverage. Despite recent progress, existing methods fall short of atomic accuracy in protein structure prediction.

NATURE (2021)

Article Biochemistry & Molecular Biology

Ensembl 2021

Kevin L. Howe et al.

Summary: The Ensembl project provides genome annotation and data dissemination services for vertebrate species, including detailed annotation of gene structures, regulatory elements, and variants, as well as inferring the evolutionary history of genes and genomes. They offer integrated genomic data through various means such as genome browsers, search interfaces, specialist tools, and download files. Recent developments include the Ensembl Rapid Release and the SARS-CoV-2 genome browser, aiding in the international scientific response to the COVID-19 pandemic.

NUCLEIC ACIDS RESEARCH (2021)

Article Computer Science, Artificial Intelligence

DeepSite: bidirectional LSTM and CNN models for predicting DNA-protein binding

Yongqing Zhang et al.

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS (2020)

Article Biochemical Research Methods

Enhancer prediction in the human genome by probabilistic modelling of the chromatin feature patterns

Maria Osmala et al.

BMC BIOINFORMATICS (2020)

Article Microbiology

Benchmarking Bacterial Promoter Prediction Tools: Potentialities and Limitations

Murilo Henrique Anzolini Cassiano et al.

MSYSTEMS (2020)

Proceedings Paper Biochemical Research Methods

Deep Learning to Identify Transcription Start Sites from CAGE Data

Hansi Zheng et al.

2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (2020)

Article Genetics & Heredity

DeePromoter: Robust Promoter Predictor Using Deep Learning

Mhaned Oubounyt et al.

FRONTIERS IN GENETICS (2019)

Article Multidisciplinary Sciences

TransPrise: a novel machine learning approach for eukaryotic promoter prediction

Stepan Pachganov et al.

Article Biochemical Research Methods

Promoter analysis and prediction in the human genome using sequence-based deep learning models

Ramzan Umarov et al.

BIOINFORMATICS (2019)

Article Mathematical & Computational Biology

TISRover: ConvNets learn biologically relevant features for effective translation initiation site prediction

Jasper Zuallaert et al.

INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS (2018)

Article Biotechnology & Applied Microbiology

A universal SNP and small-indel variant caller using deep neural networks

Ryan Poplin et al.

NATURE BIOTECHNOLOGY (2018)

Article Biochemical Research Methods

bTSSfinder: a novel tool for the prediction of promoters in cyanobacteria and Escherichia coli

Ilham Ayub Shahmuradov et al.

BIOINFORMATICS (2017)

Article Biochemistry & Molecular Biology

TSSPlant: a new tool for prediction of plant Pol II promoters

Ilham A. Shahmuradov et al.

NUCLEIC ACIDS RESEARCH (2017)

Article Biochemical Research Methods

TITER: predicting translation initiation sites by deep learning

Sai Zhang et al.

BIOINFORMATICS (2017)

Article Computer Science, Hardware & Architecture

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky et al.

COMMUNICATIONS OF THE ACM (2017)

Article Mathematical & Computational Biology

Construction of precise support vector machine based models for predicting promoter strength

Hailin Meng et al.

QUANTITATIVE BIOLOGY (2017)

Article Computer Science, Interdisciplinary Applications

SD-MSAEs: Promoter recognition in human genome based on deep feature extraction

Wenxuan Xu et al.

JOURNAL OF BIOMEDICAL INFORMATICS (2016)

Article Biochemical Research Methods

SMOTE for high-dimensional class-imbalanced data

Rok Blagus et al.

BMC BIOINFORMATICS (2013)

Article Biochemistry & Molecular Biology

The sequence read archive: explosive growth of sequencing data

Yuichi Kodama et al.

NUCLEIC ACIDS RESEARCH (2012)

Article Multidisciplinary Sciences

High Sensitivity TSS Prediction: Estimates of Locations Where TSS Cannot Occur

Ulf Schaefer et al.

PLOS ONE (2010)

Article Biochemical Research Methods

Toward a gold standard for promoter prediction evaluation

Thomas Abeel et al.

BIOINFORMATICS (2009)

Article Biotechnology & Applied Microbiology

BioMart - biological queries made easy

Damian Smedley et al.

BMC GENOMICS (2009)

Article Biochemical Research Methods

Comparing sequences without using alignments: application to HIV/SIV subtyping

Gilles Didier et al.

BMC BIOINFORMATICS (2007)

Article Computer Science, Artificial Intelligence

The prediction of bacterial transcription start sites using SVMs

Michael W. Towsey et al.

INTERNATIONAL JOURNAL OF NEURAL SYSTEMS (2006)

Article Biochemical Research Methods

ARTS:: accurate recognition of transcription starts in human

Soeren Sonnenburg et al.

BIOINFORMATICS (2006)

Review Biotechnology & Applied Microbiology

Performance assessment of promoter predictions on ENCODE regions in the EGASP experiment

Vladimir B. Bajic et al.

GENOME BIOLOGY (2006)

Article Biochemical Research Methods

RASE:: recognition of alternatively spliced exons in C.elegans

G Rätsch et al.

BIOINFORMATICS (2005)

Article Biotechnology & Applied Microbiology

Promoter prediction analysis on the whole human genome

VB Bajic et al.

NATURE BIOTECHNOLOGY (2004)