4.5 Article

Overall assessment for selected markers from high-throughput data

期刊

STATISTICS IN MEDICINE
卷 41, 期 30, 页码 5830-5843

出版社

WILEY
DOI: 10.1002/sim.9596

关键词

false discovery rate; reproducibility; selection adjustment; validation study

资金

  1. National Research Foundation of Korea [NRF-2021R1A2C1012865]
  2. National Research Foundation of Korea (NRF) - Korea government (MSIT) [2021R1A2C1014409]
  3. National Research Foundation of Korea [2021R1A2C1014409] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

向作者/读者索取更多资源

This paper focuses on reproducibility assessment in high-throughput studies and proposes a selection-adjusted false-discovery rate (sFDR) as an overall assessment measure. By integrating information from both training and validation studies and considering the effects of non-random selection, sFDR provides a more accurate evaluation. Simulation studies and real metabolomic datasets are used to illustrate the application of sFDR in high-throughput data analysis.
Reproducibility, a hallmark of science, is typically assessed in validation studies. We focus on high-throughput studies where a large number of biomarkers is measured in a training study, but only a subset of the most significant findings is selected and re-tested in a validation study. Our aim is to get the statistical measures of overall assessment for the selected markers, by integrating the information in both the training and validation studies. Naive statistical measures, such as the combined P$$ P $$-value by conventional meta-analysis, that ignore the non-random selection are clearly biased, producing over-optimistic significance. We use the false-discovery rate (FDR) concept to develop a selection-adjusted FDR (sFDR) as an overall assessment measure. We describe the link between the overall assessment and other concepts such as replicability and meta-analysis. Some simulation studies and two real metabolomic datasets are considered to illustrate the application of sFDR in high-throughput data analyses.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据