☆ 4.6 Article

R.ROSETTA: an interpretable machine learning framework

BMC BIOINFORMATICS (2021)

期刊

BMC BIOINFORMATICS

卷 22, 期 1, 页码 -

出版社

BMC

DOI: 10.1186/s12859-021-04049-z

关键词

Transcriptomics; Interpretable machine learning; Big data; Rough sets; Rule-based classification; R package

类别

Biochemical Research Methods Biotechnology & Applied Microbiology Mathematical & Computational Biology

资金

Uppsala University
Foundation for the National Institutes of Health [0925-0001]
Uppsala University, Sweden
eSSENCE grant
Swedish Research Council [2017-01861]
Polish Academy of Sciences
Swedish Research Council [2017-01861] Funding Source: Swedish Research Council

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The R.ROSETTA package is an R wrapper of the ROSETTA framework, allowing for the construction and analysis of non-linear interpretable machine learning models. It gathers combinatorial statistics via rule-based modelling for transparent results, suitable for adoption within the wider scientific community. The package also provides statistical and visualization tools to minimize analysis bias and noise.

BackgroundMachine learning involves strategies and algorithms that may assist bioinformatics analyses in terms of data mining and knowledge discovery. In several applications, viz. in Life Sciences, it is often more important to understand how a prediction was obtained rather than knowing what prediction was made. To this end so-called interpretable machine learning has been recently advocated. In this study, we implemented an interpretable machine learning package based on the rough set theory. An important aim of our work was provision of statistical properties of the models and their components.ResultsWe present the R.ROSETTA package, which is an R wrapper of ROSETTA framework. The original ROSETTA functions have been improved and adapted to the R programming environment. The package allows for building and analyzing non-linear interpretable machine learning models. R.ROSETTA gathers combinatorial statistics via rule-based modelling for accessible and transparent results, well-suited for adoption within the greater scientific community. The package also provides statistics and visualization tools that facilitate minimization of analysis bias and noise. The R.ROSETTA package is freely available at https://github.com/komorowskilab/R.ROSETTA. To illustrate the usage of the package, we applied it to a transcriptome dataset from an autism case-control study. Our tool provided hypotheses for potential co-predictive mechanisms among features that discerned phenotype classes. These co-predictors represented neurodevelopmental and autism-related genes.ConclusionsR.ROSETTA provides new insights for interpretable machine learning analyses and knowledge-based systems. We demonstrated that our package facilitated detection of dependencies for autism-related genes. Although the sample application of R.ROSETTA illustrates transcriptome data analysis, the package can be used to analyze any data organized in decision tables.

R.ROSETTA: an interpretable machine learning framework

期刊

BMC BIOINFORMATICS

出版社

BMC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

R.ROSETTA: an interpretable machine learning framework

期刊

BMC BIOINFORMATICS

出版社

BMC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文