4.6 Article

An ensemble solution for multivariate time series clustering

Journal

NEUROCOMPUTING
Volume 457, Issue -, Pages 182-192

Publisher

ELSEVIER
DOI: 10.1016/j.neucom.2020.09.093

Keywords

Multi-variate time series; Clustering; Prior knowledge; Ensemble of clustering

Funding

  1. Spanish Ministry of Science and Innovation under project MINECO [TIN2017-84804-R]
  2. Asturias Regional Government [FC-GRUPIN-IDI/2018/000226]
  3. Instituto para la Competitividad Empresarial de Castilla y Leon [CCTT2/18/BU/0002]

Ask authors/readers for more resources

This study proposes an ensemble of MTS clustering methods that merges different MTS representations and distance functions, which helps clustering practitioners in selecting suitable prototypes. The results show bias towards the best methods and emphasize the importance of metrics in guiding the clustering process. Further work involves the study of digital markers to compare MTS representations and distance functions, as well as using external metrics to balance the aggregation of methods.
Technologies such as Big Data and IoT have shown the need for intelligent unsupervised processing of Multivariate Time Series (MTS), MTS clustering among them. The challenges in MTS clustering includes not only the selection of the algorithm but also the MTS representation and the similarity measurement among the instances. This study proposes an ensemble of MTS clustering methods that merges different MTS representations and distance functions, aggregating them to obtain a similarity measurement. Furthermore, a proposal for prior knowledge representation is propose to balance the aggregation of the distances. The final clustering is performed either using k-means or hierarchical clustering. The experimentation set up includes the implementation of the ensemble with either 4 or 5 different methods, including an MTS extension of k-Shape. The results show that the ensemble is biased towards the best methods, which helps the clustering practitioner in the selection of the most suitable prototypes. Moreover, the evaluation of the ensemble with the number of clusters set to the number of labels shows that metrics, such as the sensitivity and specificity, must drive the rule of the elbow; alternatively, this value represents the most interesting prior knowledge bit in MTS clustering. Further work includes the study of digital markers to compare MTS representations and distance functions and the use of external metrics to balance the aggregation of the methods. (c) 2021 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available