4.5 Article

Convex covariate clustering for classification

Journal

PATTERN RECOGNITION LETTERS
Volume 151, Issue -, Pages 193-199

Publisher

ELSEVIER
DOI: 10.1016/j.patrec.2021.08.012

Keywords

Alternating direction method of multipliers; Convex optimization; Model selection; Marginal likelihood; Text classification

Ask authors/readers for more resources

This paper presents a covariate clustering method that takes into account sample class label information and formulates the problem as a convex optimization problem. Experimental results confirm the high utility and effectiveness of the proposed method.
Clustering, like covariate selection for classification, is an important step to compress and interpret the data. However, clustering of covariates is often performed independently of the classification step, which can lead to undesirable clustering results that harm interpretability and compression rate. Therefore, we propose a method that can cluster covariates while taking into account class label information of samples. We formulate the problem as a convex optimization problem which uses both, a-priori similarity information between covariates, and information from class-labeled samples. Like ordinary convex clustering [1], the proposed method offers a unique global minima making it insensitive to initialization. In order to solve the convex problem, we propose a specialized alternating direction method of multipliers (ADMM), which scales up to several thousands of variables. Furthermore, in order to circumvent computationally expensive cross-validation, we propose a model selection criterion based on approximating the marginal likelihood. Experiments on synthetic and real data confirm the usefulness of the proposed clustering method and the selection criterion. (c) 2021 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license ( http://creativecommons.org/licenses/by/4.0/ )

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available