4.6 Article

CEAT: Curvature Feature Extractor Using Action Based Triplet Learning for Action Segmentation

Journal

IEEE ACCESS
Volume 11, Issue -, Pages 79445-79454

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2023.3298960

Keywords

Action segmentation; Bezier curve approximation; contrastive learning

Ask authors/readers for more resources

With the increasing number of untrimmed videos on the internet, there is a growing demand for advanced action segmentation methods that can accurately localize sequences within lengthy videos. Traditional approaches have tried to address the issue of over-segmentation by smoothing consecutive frame predictions, but this may overlook important spatio-temporal characteristics. To address these challenges more effectively, we propose a novel approach that constructs a geometric curve based on frame-wise embeddings and extracts curvature features. Experimental results show that incorporating curvature information into existing action segmentation models can significantly enhance performance.
With the continued growth of untrimmed videos on the internet, there is an increasing demand for advanced action segmentation methods, capable of accurately and semantically localizing sequences within lengthy videos. Traditional approaches have attempted to overcome the prevalent issue of over-segmentation by smoothing the predictions of consecutive frames. However, this technique can potentially overlook important spatio-temporal characteristics. Other common strategies include the incorporation of supplementary temporal data, which can be difficult to obtain in practical real-world scenarios. To more effectively address these problems, we propose a novel approach that constructs a geometric curve based on frame-wise embeddings and extracts curvature features. This procedure allows us to leverage the curvature information of embedded vectors and seamlessly integrate spatio-temporal information into existing action segmentation models. Our investigation reveals that our novel curvature-based approach enriches embedding representations, making them more suitable for action segmentation. It effectively brings closely together the representations of similar actions from different videos while appropriately distancing dissimilar action frames from the same video. Consequently, our experimental results provide substantial evidence that incorporating curvature information into various existing action segmentation models can significantly enhance action segmentation performances.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available