4.6 Article

Dimensionality reduction of longitudinal 'omics data using modern tensor factorizations

期刊

PLOS COMPUTATIONAL BIOLOGY
卷 18, 期 7, 页码 -

出版社

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pcbi.1010212

关键词

-

资金

  1. Israeli Council for Higher Education (CHE) via the Weizmann Data Science Research Center

向作者/读者索取更多资源

Longitudinal 'omics analytical methods are widely used in precision medicine to analyze complex datasets and uncover individual variations in response to perturbations. However, technical limitations often generate feature-rich and sample-limited datasets that are challenging to analyze using conventional methods. This study presents TCAM, a new unsupervised tensor factorization method, which preserves the geometric and statistical traits of the data and allows for out-of-sample extension. re-analyses of real-world datasets confirm TCAM's utility in analyzing longitudinal 'omics data.
Longitudinal 'omics analytical methods are extensively used in the field of evolving precision medicine, by enabling 'big data' recording and high-resolution interpretation of complex datasets, driven by individual variations in response to perturbations such as disease pathogenesis, medical treatment or changes in lifestyle. However, inherent technical limitations in biomedical studies often result in the generation of feature-rich and sample-limited datasets. Analyzing such data using conventional modalities often proves to be challenging since the repeated, high-dimensional measurements overload the outlook with inconsequential variations that must be filtered from the data in order to find the true, biologically relevant signal. Tensor methods for the analysis and meaningful representation of multi-way data may prove useful to the biological research community by their advertised ability to tackle this challenge. In this study, we present TCAM-a new unsupervised tensor factorization method for the analysis of multi-way data. Building on top of cutting-edge developments in the field of tensor-tensor algebra, we characterize the unique mathematical properties of our method, namely, 1) preservation of geometric and statistical traits of the data, which enables uncovering information beyond the inter-individual variation that often takes-over the focus, especially in human studies. 2) Natural and straightforward out-of-sample extension, making TCAM amenable for integration in machine learning workflows. A series of re-analyses of real-world, human experimental datasets showcase these theoretical properties, while providing empirical confirmation of TCAM's utility in the analysis of longitudinal 'omics data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据