4.8 Article

Adaptive Multi-View and Temporal Fusing Transformer for 3D Human Pose Estimation

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Computer Science, Information Systems

Exploiting Temporal Contexts With Strided Transformer for 3D Human Pose Estimation

Wenhao Li et al.

Summary: This paper proposes an improved Transformer-based architecture called Strided Transformer, which can transform a redundant 2D pose sequence into a single 3D pose. By replacing the fully-connected layers with strided convolutions, the sequence redundancy is reduced and local context information is aggregated. A full-to-single supervision scheme is designed to enforce temporal smoothness constraints and improve the accuracy of 3D pose estimation.

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

Article Computer Science, Artificial Intelligence

Limb Pose Aware Networks for Monocular 3D Pose Estimation

Lele Wu et al.

Summary: In this study, we propose a limb pose aware framework consisting of a kinematic constraint aware network and a trajectory aware temporal module to improve the 3D prediction accuracy of limb joint positions. By introducing relative bone angles and absolute bone angles as kinematic constraints, and incorporating a hierarchical Transformer network for trajectory estimation, we successfully alleviate the problem of errors accumulated along limbs and achieve promising results.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2022)

Article Computer Science, Artificial Intelligence

Locally Connected Network for Monocular 3D Human Pose Estimation

Hai Ci et al.

Summary: In this paper, an approach for estimating 3D human pose from monocular images is presented. By combining graph convolutional network (GCN) with locally connected network (LCN), the proposed approach achieves better performance on benchmark datasets and demonstrates strong generalization ability.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

Article Computer Science, Software Engineering

MotioNet: 3D Human Motion Reconstruction from Monocular Video with Skeleton Consistency

Mingyi Shi et al.

Summary: MotioNet is a deep neural network that reconstructs 3D human skeleton motion from monocular video. The network decomposes 2D joint position sequences into bone length-encoded skeleton and 3D joint rotation sequences, outputting 3D positions through an integrated FK layer for comparison with ground truth.

ACM TRANSACTIONS ON GRAPHICS (2021)

Article Computer Science, Artificial Intelligence

AdaFuse: Adaptive Multiview Fusion for Accurate Human Pose Estimation in the Wild

Zhe Zhang et al.

Summary: AdaFuse is an adaptive multiview fusion method designed to address occlusion challenges in human pose estimation in the wild. It effectively determines point-point correspondences between different views and learns adaptive fusion weights to optimize feature quality. Through evaluations on public datasets, AdaFuse outperforms state-of-the-art methods in all metrics.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2021)

Article Computer Science, Artificial Intelligence

Towards Efficient Scene Understanding via Squeeze Reasoning

Xiangtai Li et al.

Summary: In this paper, a novel framework called Squeeze Reasoning is proposed to efficiently perform context graph reasoning by squeezing input features into a channel-wise global vector and conducting reasoning within the vector. This approach reduces computational cost and achieves significant results on various semantic segmentation datasets.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2021)

Article Computer Science, Artificial Intelligence

A generalizable approach for multi-view 3D human pose regression

Abdolrahim Kadkhodamohammadi et al.

MACHINE VISION AND APPLICATIONS (2020)

Proceedings Paper Computer Science, Artificial Intelligence

Cascaded Deep Monocular 3D Human Pose Estimation with Evolutionary Training Data

Shichao Li et al.

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2020)

Proceedings Paper Computer Science, Artificial Intelligence

Lightweight Multi-View 3D Pose Estimation through Camera-Disentangled Representation

Edoardo Remelli et al.

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2020)

Proceedings Paper Computer Science, Artificial Intelligence

Deep Kinematics Analysis for Monocular 3D Human Pose Estimation

Jingwei Xu et al.

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2020)

Article Computer Science, Artificial Intelligence

Robust 3D Human Pose Estimation from Single Images or Video Sequences

Chunyu Wang et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Learnable Triangulation of Human Pose

Karim Iskakov et al.

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Cross View Fusion for 3D Human Pose Estimation

Haibo Qiu et al.

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Monocular 3D Human Pose Estimation by Generation and Ordinal Ranking

Saurabh Sharma et al.

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) (2019)

Proceedings Paper Computer Science, Artificial Intelligence

HEMlets Pose: Learning Part-Centric Heatmap Triplets for Accurate 3D Human Pose Estimation

Kun Zhou et al.

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views

Junting Dong et al.

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Generating Multiple Hypotheses for 3D Human Pose Estimation with Mixture Density Network

Chen Li et al.

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Semantic Graph Convolutional Networks for 3D Human Pose Regression

Long Zhao et al.

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) (2019)

Article Computer Science, Artificial Intelligence

Skeleton-Based Action Recognition Using Spatio-Temporal LSTM Network with Trust Gates

Jun Liu et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2018)

Proceedings Paper Computer Science, Artificial Intelligence

Compositional Human Pose Regression

Xiao Sun et al.

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

A simple yet effective baseline for 3d human pose estimation

Julieta Martinez et al.

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

3D Human Pose Estimation from a Single Image via Distance Matrix Regression

Francesc Moreno-Noguer

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Harvesting Multiple Views for Marker-less 3D Human Pose Annotations

Georgios Pavlakos et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose

Georgios Pavlakos et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Article Computer Science, Artificial Intelligence

Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments

Catalin Ionescu et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2014)