4.7 Article

Multipartition clustering of mixed data with Bayesian networks

Journal

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS
Volume 37, Issue 3, Pages 2188-2218

Publisher

WILEY-HINDAWI
DOI: 10.1002/int.22770

Keywords

Bayesian networks; mixed data; model-based clustering; multipartition clustering

Funding

  1. Ministerio de Economia y Competitividad [PID2019-109247GB-I00]
  2. Horizon 2020 [945539]
  3. Fundacion BBVA

Ask authors/readers for more resources

This paper introduces a multipartition clustering method for mixed data, which efficiently handles multifaceted data with several reasonable interpretations by utilizing Bayesian network factorization and the variational Bayes framework.
Real-world applications often involve multifaceted data with several reasonable interpretations. To cluster this data, we need methods that are able to produce multiple clustering solutions. To this purpose, it is interesting to learn a finite mixture model with multiple latent variables, where each latent variable represents a unique way to partition the data. However, although there is an extensive literature on multipartition clustering methods for categorical data and for continuous data, there is a lack of work for mixed data. In this paper, we propose a multipartition clustering method that is able to efficiently deal with mixed data by exploiting the Bayesian network factorization and the variational Bayes framework. We show the flexibility and applicability of the proposed method by solving clustering, density estimation, and missing data imputation tasks in real-world data sets. For reproducibility, all code, data, and results can be found in the following public repository: .

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available