4.6 Article

CEAT: Curvature Feature Extractor Using Action Based Triplet Learning for Action Segmentation

期刊

IEEE ACCESS
卷 11, 期 -, 页码 79445-79454

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2023.3298960

关键词

Action segmentation; Bezier curve approximation; contrastive learning

向作者/读者索取更多资源

With the increasing number of untrimmed videos on the internet, there is a growing demand for advanced action segmentation methods that can accurately localize sequences within lengthy videos. Traditional approaches have tried to address the issue of over-segmentation by smoothing consecutive frame predictions, but this may overlook important spatio-temporal characteristics. To address these challenges more effectively, we propose a novel approach that constructs a geometric curve based on frame-wise embeddings and extracts curvature features. Experimental results show that incorporating curvature information into existing action segmentation models can significantly enhance performance.
With the continued growth of untrimmed videos on the internet, there is an increasing demand for advanced action segmentation methods, capable of accurately and semantically localizing sequences within lengthy videos. Traditional approaches have attempted to overcome the prevalent issue of over-segmentation by smoothing the predictions of consecutive frames. However, this technique can potentially overlook important spatio-temporal characteristics. Other common strategies include the incorporation of supplementary temporal data, which can be difficult to obtain in practical real-world scenarios. To more effectively address these problems, we propose a novel approach that constructs a geometric curve based on frame-wise embeddings and extracts curvature features. This procedure allows us to leverage the curvature information of embedded vectors and seamlessly integrate spatio-temporal information into existing action segmentation models. Our investigation reveals that our novel curvature-based approach enriches embedding representations, making them more suitable for action segmentation. It effectively brings closely together the representations of similar actions from different videos while appropriately distancing dissimilar action frames from the same video. Consequently, our experimental results provide substantial evidence that incorporating curvature information into various existing action segmentation models can significantly enhance action segmentation performances.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据