☆ 4.8 Article

Fast and Robust Multi-Person 3D Pose Estimation and Tracking From Multiple Views

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

Journal

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE

Volume 44, Issue 10, Pages 6981-6992

Publisher

IEEE COMPUTER SOC

DOI: 10.1109/TPAMI.2021.3098052

Keywords

Three-dimensional displays; Image reconstruction; Pose estimation; Cameras; Solid modeling; Noise measurement; Detectors; 3D human pose estimation; motion capture; multi-view reconstruction

Funding

National Key Research and Development Program of China [2020AAA0108901]
NSFC [61806176]
ZJU-SenseTime Joint Lab of 3D Vision
NSFHDR [TRIPODS-1934932]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This paper addresses the problem of reconstructing 3D poses of multiple people from a few calibrated camera views. The proposed approach uses a multi-way matching algorithm to cluster detected 2D poses and infer the 3D poses of each person efficiently. It also combines geometric and appearance cues for cross-view matching and proposes an efficient tracking method.

This paper addresses the problem of reconstructing 3D poses of multiple people from a few calibrated camera views. The main challenge of this problem is to find the cross-view correspondences among noisy and incomplete 2D pose predictions. Most previous methods address this challenge by directly reasoning in 3D using a pictorial structure model, which is inefficient due to the huge state space. We propose a fast and robust approach to solve this problem. Our key idea is to use a multi-way matching algorithm to cluster the detected 2D poses in all views. Each resulting cluster encodes 2D poses of the same person across different views and consistent correspondences across the keypoints, from which the 3D pose of each person can be effectively inferred. The proposed convex optimization based multi-way matching algorithm is efficient and robust against missing and false detections, without knowing the number of people in the scene. Moreover, we propose to combine geometric and appearance cues for cross-view matching. Finally, an efficient tracking method is proposed to track the detected 3D poses across the multi-view video. The proposed approach achieves the state-of-the-art performance on the Campus and Shelf datasets, while being efficient for real-time applications.

Fast and Robust Multi-Person 3D Pose Estimation and Tracking From Multiple Views

Journal

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE

Publisher

IEEE COMPUTER SOC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Fast and Robust Multi-Person 3D Pose Estimation and Tracking From Multiple Views

Journal

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE

Publisher

IEEE COMPUTER SOC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper