Journal
NATURE REVIEWS GENETICS
Volume 16, Issue 6, Pages 321-332Publisher
NATURE PUBLISHING GROUP
DOI: 10.1038/nrg3920
Keywords
-
Categories
Funding
- NCI NIH HHS [R01 CA180777] Funding Source: Medline
Ask authors/readers for more resources
The field of machine learning, which aims to develop computer algorithms that improve with experience, holds promise to enable computers to assist humans in the analysis of large, complex data sets. Here, we provide an overview of machine learning applications for the analysis of genome sequencing data sets, including the annotation of sequence elements and epigenetic, proteomic or metabolomic data. We present considerations and recurrent challenges in the application of supervised, semi-supervised and unsupervised machine learning methods, as well as of generative and discriminative modelling approaches. We provide general guidelines to assist in the selection of these machine learning methods and their practical application for the analysis of genetic and genomic data sets.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available