4.6 Article

Deep semi-supervised clustering for multi-variate time-series

期刊

NEUROCOMPUTING
卷 516, 期 -, 页码 36-47

出版社

ELSEVIER
DOI: 10.1016/j.neucom.2022.10.033

关键词

Multi-variate time series; Clustering; Semi-Supervised; Constrained clustering

向作者/读者索取更多资源

A huge amount of data from sensors can be organized as multivariate time series. In a limited background knowledge setting, semi-supervised clustering methods can effectively utilize a small amount of knowledge. We propose a constrained deep embedding time series clustering framework that manages the temporal dimension and exploits Must-link and Cannot-link constraints for better clustering results.
Huge amount of data are nowadays produced by a large and disparate family of sensors, which typically measure multiple variables over time. Such rich information can be profitably organized as multivariate time-series. Collect enough labelled samples to set up supervised analysis for such kind of data is chal-lenging while a reasonable assumption is to dispose of a limited background knowledge that can be injected in the analysis process. In this context, semi-supervised clustering methods represent a well sui-ted tool to get the most out of such reduced amount of knowledge. With the aim to deal with multivariate time-series analysis under a limited background knowledge setting, we propose a semi-supervised (con-strained) deep embedding time-series clustering framework that exploits knowledge supervision mod-eled as Must-and Cannot-link constraints. More in detail, our proposal, named conDetSEC (constrained Deep embedding time SEries Clustering), is based on Gated Recurrent Units (GRUs) with the aim to explicitly manage the temporal dimension associated to multi-variate time series data. conDetSEC implements a procedure in which an embedding generation step is combined with a cluster-ing refinement step. Both steps exploit the small amount of available knowledge provided by Must-and Cannot-link constraints. More specifically, during the data embedding generation the constraints are used by jointly optimizing the network parameters via both unsupervised and semi-supervised tasks, while at the refinement step they are used in conjunction with the goal to stretch the embedding man-ifold towards the clustering centroids to recover a more clear cluster structure. Experimental evaluation on real-world benchmarks coming from diverse domains has highlighted the effectiveness of our pro-posal in comparison with state-of-the-art unsupervised and semi-supervised time-series clustering methods. (c) 2022 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据