3.8 Proceedings Paper

Controlling False Discoveries During Interactive Data Exploration

出版社

ASSOC COMPUTING MACHINERY
DOI: 10.1145/3035918.3064019

关键词

-

资金

  1. Intel Science and Technology Center for Big Data
  2. DARPA Award [16-43-D3M-FP-040]
  3. NSF CAREER Award [IIS-1453171]
  4. NSF [IIS-1562657, IIS-1514491]
  5. Air Force YIP AWARD [FA9550-15-1-0144]
  6. Direct For Computer & Info Scie & Enginr
  7. Div Of Information & Intelligent Systems [1514491] Funding Source: National Science Foundation

向作者/读者索取更多资源

Recent tools for interactive data exploration significantly increase the chance that users make false discoveries. They allow users to (visually) examine many hypotheses and make inference with simple interactions, and thus incur the issue commonly known in statistics as the multiple hypothesis testing error. In this work, we propose a solution to integrate the control of multiple hypothesis testing into interactive data exploration systems. A key insight is that existing methods for controlling the false discovery rate (such as FDR) are not directly applicable to interactive data exploration. We therefore discuss a set of new control procedures that are better suited for this task and integrate them in our system, QUDE. Via extensive experiments on both real-world and synthetic data sets we demonstrate how QUDE can help experts and novice users alike to efficiently control false discoveries.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据