4.6 Article

Early-stopped learning for action prediction in videos

Journal

Publisher

SPRINGER
DOI: 10.1007/s13735-021-00216-3

Keywords

Early action recognition; Action prediction; Deep learning; Two-stream networks

Ask authors/readers for more resources

This paper introduces a learning framework that enhances the applicability of deep learning methods in early action recognition. By encouraging learners to focus on the early parts of a video and identifying a stopping point for learning, the model can better utilize the information in these early parts.
Action prediction, also called early action recognition, is about recognizing an action in a video with partial observation. Various methods have been developed to tackle either offline or early action recognition, including deep learning approaches. In a family of deep learning methods, video frames or optical flow images are processed sequentially by the network. In this paper, we present a learning framework that can be applied to such methods to make them more appropriate for early recognition. We propose encouraging the learner to learn from earlier parts of the video and stop learning from some point on. By focusing on the earlier parts, we can expect the model to take full advantage of the information lying in these early parts. To this end, it is necessary to find a stopping point up to which enough information has been observed. We measure the amount of information with the help of the loss function. We applied our framework to Temporal Segment Networks and experimented on UCF11 and HMDB51 datasets. The results show that our method improves on Temporal Segment Networks and outperforms other baseline methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available