4.5 Article

Mint: MDL-based approach for Mining INTeresting Numerical Pattern Sets

Journal

DATA MINING AND KNOWLEDGE DISCOVERY
Volume 36, Issue 1, Pages 108-145

Publisher

SPRINGER
DOI: 10.1007/s10618-021-00799-9

Keywords

Numerical Pattern Mining; Minimum Description Length principle; Plug-in codes; Numerical Data; Hyper-rectangles

Funding

  1. Basic Research Program of the National Research University Higher School of Economics

Ask authors/readers for more resources

Research shows that there is great potential for development in the field of numerical dataset mining. The Mint algorithm proposed in this paper, based on the MDL principle, is able to discover useful, non-redundant, overlapping patterns that cover meaningful groups of objects.
Pattern mining is well established in data mining research, especially for mining binary datasets. Surprisingly, there is much less work about numerical pattern mining and this research area remains under-explored. In this paper we propose Mint, an efficient MDL-based algorithm for mining numerical datasets. The MDL principle is a robust and reliable framework widely used in pattern mining, and as well in subgroup discovery. In Mint we reuse MDL for discovering useful patterns and returning a set of non-redundant overlapping patterns with well-defined boundaries and covering meaningful groups of objects. Mint is not alone in the category of numerical pattern miners based on MDL. In the experiments presented in the paper we show that Mint outperforms competitors among which IPD, RealKrimp, and Slim.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available