☆ 4.6 Article

Optimization and expansion of non-negative matrix factorization

BMC BIOINFORMATICS (2020)

期刊

BMC BIOINFORMATICS

卷 21, 期 1, 页码 -

出版社

BMC

DOI: 10.1186/s12859-019-3312-5

关键词

Non-negative matrix factorization; Deconvolution; Imputation

类别

Biochemical Research Methods Biotechnology & Applied Microbiology Mathematical & Computational Biology

资金

Ontario Institute for Cancer Research through Government of Ontario
Prostate Cancer Canada
Movember Foundation [RS2014-01]
Terry Fox Research Institute New Investigator Award
CIHR New Investigator Award
Genome Canada
Natural Sciences and Engineering Research Council (NSERC) of Canada
Canadian Institutes of Health Research (CIHR)
Canada Foundation for Innovation (CFI)
Government of Canada through Genome Canada
Ontario Genomics Institute [OGI-125]
Discovery Frontiers: Advancing Big Data Science in Genomics Research program

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Background Non-negative matrix factorization (NMF) is a technique widely used in various fields, including artificial intelligence (AI), signal processing and bioinformatics. However existing algorithms and R packages cannot be applied to large matrices due to their slow convergence or to matrices with missing entries. Besides, most NMF research focuses only on blind decompositions: decomposition without utilizing prior knowledge. Finally, the lack of well-validated methodology for choosing the rank hyperparameters also raises concern on derived results. Results We adopt the idea of sequential coordinate-wise descent to NMF to increase the convergence rate. We demonstrate that NMF can handle missing values naturally and this property leads to a novel method to determine the rank hyperparameter. Further, we demonstrate some novel applications of NMF and show how to use masking to inject prior knowledge and desirable properties to achieve a more meaningful decomposition. Conclusions We show through complexity analysis and experiments that our implementation converges faster than well-known methods. We also show that using NMF for tumour content deconvolution can achieve results similar to existing methods like ISOpure. Our proposed missing value imputation is more accurate than conventional methods like multiple imputation and comparable to missForest while achieving significantly better computational efficiency. Finally, we argue that the suggested rank tuning method based on missing value imputation is theoretically superior to existing methods. All algorithms are implemented in the R package NNLM, which is freely available on CRAN and Github.

Optimization and expansion of non-negative matrix factorization

期刊

BMC BIOINFORMATICS

出版社

BMC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Optimization and expansion of non-negative matrix factorization

期刊

BMC BIOINFORMATICS

出版社

BMC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文