4.7 Article

Tracking Human Pose Using Max-Margin Markov Models

Journal

IEEE TRANSACTIONS ON IMAGE PROCESSING
Volume 24, Issue 12, Pages 5274-5287

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TIP.2015.2473662

Keywords

Pose tracking; pose estimation; max-margin; articulated shapes

Funding

  1. National Basic Research Program of China (973 Program) [2012CB316400]
  2. National Natural Science Foundation of China [61125204, 61432014]
  3. Fundamental Research Funds through the Central Universities [BDZ021403, JB149901]
  4. Program for Changjiang Scholars and Innovative Research Team in University of China [IRT13088]
  5. Shaanxi Innovative Research Team for Key Science and Technology [2012KCT-02]
  6. Australian Research Council [DP-120103730, FT-130101457]
  7. Key Research Program through the Chinese Academy of Sciences [KGZDEW-T03]

Ask authors/readers for more resources

We present a new method for tracking human pose by employing max-margin Markov models. Representing a human body by part-based models, such as pictorial structure, the problem of pose tracking can be modeled by a discrete Markov random field. Considering max-margin Markov networks provide an efficient way to deal with both structured data and strong generalization guarantees, it is thus natural to learn the model parameters using the max-margin technique. Since tracking human pose needs to couple limbs in adjacent frames, the model will introduce loops and will be intractable for learning and inference. Previous work has resorted to pose estimation methods, which discard temporal information by parsing frames individually. Alternatively, approximate inference strategies have been used, which can overfit to statistics of a particular data set. Thus, the performance and generalization of these methods are limited. In this paper, we approximate the full model by introducing an ensemble of two tree-structured sub-models, Markov networks for spatial parsing and Markov chains for temporal parsing. Both models can be trained jointly using the max-margin technique, and an iterative parsing process is proposed to achieve the ensemble inference. We apply our model on three challengeable data sets, which contains highly varied and articulated poses. Comprehensive experimental results demonstrate the superior performance of our method over the state-of-the-art approaches.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available