Journal
SMALL METHODS
Volume 1, Issue 11, Pages -Publisher
WILEY-V C H VERLAG GMBH
DOI: 10.1002/smtd.201700139
Keywords
clustering; data-integration; descriptors; omics; toxicogenomics
Funding
- European Union [604134, FP7-NMP-2013-SMALL-7]
- European Union [604134, FP7-NMP-2013-SMALL-7]
Ask authors/readers for more resources
The interest toward omics data is growing in the field of toxicology owing to the diverse knowledge they generate, which can improve prediction and dosage profiling for more accurate safety assessment. An integration methodology is presented where high-throughput omics data are enriched with biological-pathway information to produce a novel set of biological (BIO) descriptors by decomposing omics data to meaningful clusters in terms of both their mechanistic interpretation and correlation affinity. A generalized simulated annealing algorithm is employed to estimate the optimal partition of the enriched data and accordingly produce novel descriptors based on gene content similarity. BIO descriptors are characterized by the pathway information fused to the data; thereby, they refer to groups of genes with similar biological implications rather than specific genes, which could vary across studies. The methodology is applied to an extensive proteomics data set and demonstrates that BIO descriptors are beneficial for modeling prediction, outperforming the prediction accuracy of the original omics data, and offering a readily available biological interpretation of the findings.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available