4.7 Article

Multi-actor activity detection by modeling object relationships in extended videos based on deep learning

Related references

Note: Only part of the references are listed.
Proceedings Paper Computer Science, Artificial Intelligence

MEVA: A Large-Scale Multiview, Multimodal Video Dataset for Activity Detection

Kellie Corona et al.

Summary: The MEVA dataset is a large-scale dataset for human activity recognition, consisting of over 9300 hours of untrimmed, continuous video with diverse activities and background activities. Annotated with 37 activity types and bounding boxes of actors and props, the dataset includes multiple modalities such as RGB and thermal IR cameras, UAV footage, and GPS locations for actors. With data collected under IRB oversight and approval, the CC-BY-4.0 licensed dataset also includes additional ground camera data, UAV data, and GPS logs.

2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021) (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Argus: Efficient Activity Detection System for Extended Video Analysis

Wenhe Liu et al.

2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW) (2020)

Article Computer Science, Artificial Intelligence

Every Moment Counts: Dense Detailed Labeling of Actions in Complex Videos

Serena Yeung et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2018)

Article Computer Science, Artificial Intelligence

The THUMOS challenge on action recognition for videos in the wild

Haroon Idrees et al.

COMPUTER VISION AND IMAGE UNDERSTANDING (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Action Tubelet Detector for Spatio-Temporal Action Localization

Vicky Kalogeiton et al.

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

Joao Carreira et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Fast R-CNN

Ross Girshick

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2015)

Proceedings Paper Computer Science, Artificial Intelligence

Learning Spatiotemporal Features with 3D Convolutional Networks

Du Tran et al.

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2015)