☆ 3.8 Article

Plant MicroRNA Prediction by Supervised Machine Learning Using C5.0 Decision Trees

JOURNAL OF NUCLEIC ACIDS (2012)

Journal

JOURNAL OF NUCLEIC ACIDS

Volume 2012, Issue -, Pages -

Publisher

HINDAWI LTD

DOI: 10.1155/2012/652979

Keywords

Funding

Australian Research Council [CE0348212, DP0879308]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

MicroRNAs (miRNAs) are nonprotein coding RNAs between 20 and 22 nucleotides long that attenuate protein production. Different types of sequence data are being investigated for novel miRNAs, including genomic and transcriptomic sequences. A variety of machine learning methods have successfully predicted miRNA precursors, mature miRNAs, and other nonprotein coding sequences. MirTools, mirDeep2, and miRanalyzer require read count to be included with the input sequences, which restricts their use to deep-sequencing data. Our aim was to train a predictor using a cross-section of different species to accurately predict miRNAs outside the training set. We wanted a system that did not require read-count for prediction and could therefore be applied to short sequences extracted from genomic, EST, or RNA-seq sources. A miRNA-predictive decision-tree model has been developed by supervised machine learning. It only requires that the corresponding genome or transcriptome is available within a sequence window that includes the precursor candidate so that the required sequence features can be collected. Some of the most critical features for training the predictor are the miRNA: miRNA* duplex energy and the number of mismatches in the duplex. We present a cross-species plant miRNA predictor with 84.08% sensitivity and 98.53% specificity based on rigorous testing by leave-one-out validation.

Plant MicroRNA Prediction by Supervised Machine Learning Using C5.0 Decision Trees

Journal

JOURNAL OF NUCLEIC ACIDS

Publisher

HINDAWI LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Plant MicroRNA Prediction by Supervised Machine Learning Using C5.0 Decision Trees

Journal

JOURNAL OF NUCLEIC ACIDS

Publisher

HINDAWI LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper