4.7 Article

SE-shapelets: Semi-supervised Clustering of Time Series Using Representative Shapelets

期刊

EXPERT SYSTEMS WITH APPLICATIONS
卷 240, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2023.122584

关键词

Time series clustering; Shapelet discovery; Linear discriminant selection

向作者/读者索取更多资源

This paper proposes a semi-supervised time series clustering method (SE-Shapelets) that utilizes a small number of labeled and propagated pseudo-labeled time series to discover representative shapelets and improve clustering accuracy. The method uses a salient subsequence chain (SSC) and a linear discriminant selection (LDS) algorithm to discover and select shapelets, and experiments show higher clustering accuracy compared to other semi-supervised time series clustering methods.
Shapelets that discriminate time series using local features (subsequences) are promising for time series clustering. Existing time series clustering methods may fail to capture representative shapelets because they discover shapelets from a large pool of uninformative subsequences, and thus result in low clustering accuracy. This paper proposes a Semi-supervised Clustering of Time Series Using Representative Shapelets (SE-Shapelets) method, which utilizes a small number of labeled and propagated pseudo-labeled time series to help discover representative shapelets, thereby improving the clustering accuracy. In SE-Shapelets, we propose two techniques to discover representative shapelets for the effective clustering of time series. (1) A salient subsequence chain (SSC) that can extract salient subsequences (as candidate shapelets) of a labeled/pseudo-labeled time series, which helps remove massive uninformative subsequences from the pool. (2) A linear discriminant selection (LDS) algorithm to identify shapelets that can capture representative local features of time series in different classes, for convenient clustering. Experiments on UCR time series datasets demonstrate that SE-shapelets discovers representative shapelets and achieves higher clustering accuracy than counterpart semi-supervised time series clustering methods.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据