4.6 Article

A Coding Basis and Three-in-One Integrated Data Visualization Method 'Ana' for the Rapid Analysis of Multidimensional Omics Dataset

期刊

LIFE-BASEL
卷 12, 期 11, 页码 -

出版社

MDPI
DOI: 10.3390/life12111864

关键词

multidimensional dataset; omics; 3D heatmap; hierarchical clustering analysis; principal component analysis; olive; phenolics; phytochemical; MATLAB((R))

资金

  1. California Department of Food and Agriculture, 2020 Specialty Crop Block Grant Program [20-0001-033-SF]

向作者/读者索取更多资源

With the advancements in analytical instruments and computer technology, omics studies based on statistical analysis have become increasingly popular in food chemistry and nutrition science. However, the labor-intensive data processing remains a challenge for researchers without coding backgrounds. In this work, a MATLAB(R) coding basis and three-in-one integrated method called 'Ana' was developed to address this issue. The program allows for rapid and efficient data visualization and statistical analysis, providing researchers with a convenient tool for their analysis needs.
With innovations and advancements in analytical instruments and computer technology, omics studies based on statistical analysis, such as phytochemical omics, oilomics/lipidomics, proteomics, metabolomics, and glycomics, are increasingly popular in the areas of food chemistry and nutrition science. However, a remaining hurdle is the labor-intensive data process because learning coding skills and software operations are usually time-consuming for researchers without coding backgrounds. A MATLAB((R)) coding basis and three-in-one integrated method, 'Ana', was created for data visualizations and statistical analysis in this work. The program loaded and analyzed an omics dataset from an Excel(R) file with 7 samples * 22 compounds as an example, and output six figures for three types of data visualization, including a 3D heatmap, heatmap hierarchical clustering analysis, and principal component analysis (PCA), in 18 s on a personal computer (PC) with a Windows 10 system and in 20 s on a Mac with a MacOS Monterey system. The code is rapid and efficient to print out high-quality figures up to 150 or 300 dpi. The output figures provide enough contrast to differentiate the omics dataset by both color code and bar size adjustments per their higher or lower values, allowing the figures to be qualified for publication and presentation purposes. It provides a rapid analysis method that would liberate researchers from labor-intensive and time-consuming manual or coding basis data analysis. A coding example with proper code annotations and completed user guidance is provided for undergraduate and postgraduate students to learn coding basis statistical data analysis and to help them utilize such techniques for their future research.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据