4.7 Article

Variable-Length Subsequence Clustering in Time Series

期刊

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TKDE.2020.2986965

关键词

Time series analysis; Data mining; Optimization; Clustering algorithms; Clustering methods; Adaptation models; Feature extraction; Time series data mining; subsequence clustering; variable-length patterns; time series segmentation

资金

  1. National Natural Science Foundation of China [61901454, 61971404, 61501434]
  2. Youth Innovation Promotion Association CAS [2019168]
  3. Foundation of key Laboratory of Space Utilization, Technology and Engineering Center for Space utilization Chinese Academy of Sciences [CSU-QZKT-2018-08]

向作者/读者索取更多资源

This paper proposes an optimization framework for adaptively estimating the lengths and representations of different patterns in subsequence clustering. By minimizing the errors in subsequence clustering and segmentation under time series cover constraint, our framework can automatically extract unknown variable-length subsequence clusters in time series.
Subsequence clustering is an important issue in time series data mining. Observing that most time series consist of various patterns with different unknown lengths, we propose an optimization framework to adaptively estimate the lengths and representations for different patterns. Our framework minimizes the inner subsequence cluster errors with respect to subsequence clusters and segmentation under time series cover constraint where the subsequence cluster lengths can be variable. To optimize our framework, we first generate abundant initial subsequence clusters with different lengths. Then, three cluster operations, i.e., cluster splitting, combination and removing, are used to iteratively refine the cluster lengths and representations by respectively splitting clusters consisting of different patterns, joining neighboring clusters belonging to the same pattern and removing clusters to the predefined cluster number. During each cluster refinement, we employ an efficient algorithm to alternatively optimize subsequence clusters and segmentation based on dynamic programming. Our method can automatically and efficiently extract the unknown variable-length subsequence clusters in the time series. Comparative results with the state-of-the-art are conducted on various synthetic and real time series, and quantitative and qualitative performances demonstrate the effectiveness of our method.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据