3.8 Proceedings Paper

Numerical Pattern Mining Through Compression

Journal

2019 DATA COMPRESSION CONFERENCE (DCC)
Volume -, Issue -, Pages 112-121

Publisher

IEEE
DOI: 10.1109/DCC.2019.00019

Keywords

-

Funding

  1. Russian Science Foundation [17-11-01294]

Ask authors/readers for more resources

Pattern Mining (PM) has a prominent place in Data Science and finds its application in a wide range of domains. To avoid the exponential explosion of patterns different methods have been proposed. They are based on assumptions on interestingness and usually return very different pattern sets. In this paper we propose to use a compression-based objective as a well-justified and robust interestingness measure. We define the description lengths for datasets and use the Minimum Description Length principle (MDL) to find patterns that ensure the best compression. Our experiments show that the application of MDL to numerical data provides a small and characteristic subsets of patterns describing data in a compact way.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available