☆ 4.5 Article

Time series shapelets: a novel technique that allows accurate, interpretable and fast classification

DATA MINING AND KNOWLEDGE DISCOVERY (2011)

Journal

DATA MINING AND KNOWLEDGE DISCOVERY

Volume 22, Issue 1-2, Pages 149-182

Publisher

SPRINGER

DOI: 10.1007/s10618-010-0179-5

Keywords

Time series; Data mining; Classification; Decision tree

Funding

NSF [0803410, 0808770]
Direct For Computer & Info Scie & Enginr
Div Of Information & Intelligent Systems [0808770] Funding Source: National Science Foundation
Div Of Information & Intelligent Systems
Direct For Computer & Info Scie & Enginr [0803410] Funding Source: National Science Foundation

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Classification of time series has been attracting great interest over the past decade. While dozens of techniques have been introduced, recent empirical evidence has strongly suggested that the simple nearest neighbor algorithm is very difficult to beat for most time series problems, especially for large-scale datasets. While this may be considered good news, given the simplicity of implementing the nearest neighbor algorithm, there are some negative consequences of this. First, the nearest neighbor algorithm requires storing and searching the entire dataset, resulting in a high time and space complexity that limits its applicability, especially on resource-limited sensors. Second, beyond mere classification accuracy, we often wish to gain some insight into the data and to make the classification result more explainable, which global characteristics of the nearest neighbor cannot provide. In this work we introduce a new time series primitive, time series shapelets, which addresses these limitations. Informally, shapelets are time series subsequences which are in some sense maximally representative of a class. We can use the distance to the shapelet, rather than the distance to the nearest neighbor to classify objects. As we shall show with extensive empirical evaluations in diverse domains, classification algorithms based on the time series shapelet primitives can be interpretable, more accurate, and significantly faster than state-of-the-art classifiers.

Time series shapelets: a novel technique that allows accurate, interpretable and fast classification

Journal

DATA MINING AND KNOWLEDGE DISCOVERY

Publisher

SPRINGER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Time series shapelets: a novel technique that allows accurate, interpretable and fast classification

Journal

DATA MINING AND KNOWLEDGE DISCOVERY

Publisher

SPRINGER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper