4.5 Article Proceedings Paper

Identifying consistent statements about numerical data with dispersion-corrected subgroup discovery

Journal

DATA MINING AND KNOWLEDGE DISCOVERY
Volume 31, Issue 5, Pages 1391-1418

Publisher

SPRINGER
DOI: 10.1007/s10618-017-0520-3

Keywords

Subgroup discovery; Local pattern discovery; Branch-and-bound search

Funding

  1. Max Planck Society
  2. Cluster of Excellence Multimodal Computing and Interaction within the Excellence Initiative of the German Federal Government
  3. Alexander von Humboldt-Foundation
  4. European Union's Horizon 2020 research and innovation program [676580]
  5. Novel Materials Discovery (NOMAD) Laboratory, a European Center of Excellence

Ask authors/readers for more resources

Existing algorithms for subgroup discovery with numerical targets do not optimize the error or target variable dispersion of the groups they find. This often leads to unreliable or inconsistent statements about the data, rendering practical applications, especially in scientific domains, futile. Therefore, we here extend the optimistic estimator framework for optimal subgroup discovery to a new class of objective functions: we show how tight estimators can be computed efficiently for all functions that are determined by subgroup size (non-decreasing dependence), the subgroup median value, and a dispersion measure around the median (non-increasing dependence). In the important special case when dispersion is measured using the mean absolute deviation from the median, this novel approach yields a linear time algorithm. Empirical evaluation on a wide range of datasets shows that, when used within branch-and-bound search, this approach is highly efficient and indeed discovers subgroups with much smaller errors.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available