4.5 Article

easyMF: A Web Platform for Matrix Factorization-Based Gene Discovery from Large-scale Transcriptome Data

出版社

SPRINGER HEIDELBERG
DOI: 10.1007/s12539-022-00522-2

关键词

Gene discovery; Maize; Matrix factorization; Metagene; Seed genes; Transcriptome

资金

  1. National Natural Science Foundation of China [31570371]
  2. Youth 1000-Talent Program of China
  3. Hundred Talents Program of Shaanxi Province of China
  4. Projects of Youth Technology New Star of Shaanxi Province [2017KJXX-67]
  5. Fundamental Research Funds for the Central Universities [2452020041]

向作者/读者索取更多资源

In this study, we developed easyMF, a web platform that utilizes matrix factorization algorithms for functional gene discovery from large-scale transcriptome data. Compared with existing software, easyMF offers greater functionality, flexibility, and ease of use. The platform is equipped with user-friendly graphic user interfaces and supports various analyses, including transcriptome analysis, multiple-scenario matrix factorization analysis, and multiple-way gene discovery. We applied easyMF to maize RNA-Seq datasets and successfully identified numerous seed-specific genes. Additionally, easyMF outperformed other systems in gene prioritization.
With the development of high-throughput experimental technologies, large-scale RNA sequencing (RNA-Seq) data have been and continue to be produced, but have led to challenges in extracting relevant biological knowledge hidden in the produced high-dimensional gene expression matrices. Here, we develop easyMF (https://github.com/cma2015/easyMF), a web platform that can facilitate functional gene discovery from large-scale transcriptome data using matrix factorization (MF) algorithms. Compared with existing MF-based software packages, easyMF exhibits several promising features, such as greater functionality, flexibility and ease of use. The easyMF platform is equipped using the Big-Data-supported Galaxy system with user-friendly graphic user interfaces, allowing users with little programming experience to streamline transcriptome analysis from raw reads to gene expression, carry out multiple-scenario MF analysis, and perform multiple-way MF-based gene discovery. easyMF is also powered with the advanced packing technology to enhance ease of use under different operating systems and computational environments. We illustrated the application of easyMF for seed gene discovery from temporal, spatial, and integrated RNA-Seq datasets of maize (Zea mays L.), resulting in the identification of 3,167 seed stage-specific, 1,849 seed compartment-specific, and 774 seed-specific genes, respectively. The present results also indicated that easyMF can prioritize seed-related genes with superior prediction performance over the state-of-art network-based gene prioritization system MaizeNet. As a modular, containerized and open-source platform, easyMF can be further customized to satisfy users' specific demands of functional gene discovery and deployed as a web service for broad applications. [GRAPHICS] .

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据