3.8 Article

A Geometric Perspective on Functional Outlier Detection

期刊

STATS
卷 4, 期 4, 页码 971-1011

出版社

MDPI
DOI: 10.3390/stats4040057

关键词

functional data analysis; outlier detection; manifold learning; dimension reduction; multidimensional scaling; local outlier factors

资金

  1. German Federal Ministry of Education and Research (BMBF) [01IS18036A]

向作者/读者索取更多资源

Functional outlier detection is explored from a geometric perspective, with a focus on functional datasets drawn from a functional manifold. The conceptualization based on this manifold allows for improved theoretical understanding and practical feasibility in describing and analyzing complex functional outlier scenarios. The use of manifold-learning methods and vector-valued representations learned from these methods enables successful outlier detection performances on a variety of datasets.
We consider functional outlier detection from a geometric perspective, specifically: for functional datasets drawn from a functional manifold, which is defined by the data's modes of variation in shape, translation, and phase. Based on this manifold, we developed a conceptualization of functional outlier detection that is more widely applicable and realistic than previously proposed taxonomies. Our theoretical and experimental analyses demonstrated several important advantages of this perspective: it considerably improves theoretical understanding and allows describing and analyzing complex functional outlier scenarios consistently and in full generality, by differentiating between structurally anomalous outlier data that are off-manifold and distributionally outlying data that are on-manifold, but at its margins. This improves the practical feasibility of functional outlier detection: we show that simple manifold-learning methods can be used to reliably infer and visualize the geometric structure of functional datasets. We also show that standard outlier-detection methods requiring tabular data inputs can be applied to functional data very successfully by simply using their vector-valued representations learned from manifold learning methods as the input features. Our experiments on synthetic and real datasets demonstrated that this approach leads to outlier detection performances at least on par with existing functional-data-specific methods in a large variety of settings, without the highly specialized, complex methodology and narrow domain of application these methods often entail.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据