JCN: Joint Constraint-Based Human Pose Refinement Networks

2D human pose estimation has essential applications in traffic prediction and human-computer interaction. We propose a pose refinement network for refining human pose features to improve human pose detection accuracy. We propose our approach to two critical problems in pose estimation, occlusion and...

Full description

Saved in:
Bibliographic Details
Main Authors: Yuru Zhang, Jiayuan Zhao, Xiaodong Su, Hongyan Xu, Meijian Jin
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10543053/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:2D human pose estimation has essential applications in traffic prediction and human-computer interaction. We propose a pose refinement network for refining human pose features to improve human pose detection accuracy. We propose our approach to two critical problems in pose estimation, occlusion and background interference, for which the model often needs local information for inference. In contrast, for the background interference problem, the model needs more non-local semantic information to guide the model to make semantic judgments. Specifically, we propose Joint Constraint-Based Human Pose Refinement Networks, which explicitly and implicitly model the relationship between critical points through graph neural networks and self-attention mechanisms to capture local and non-local information of the pose features and refine the keypoints responses. In order to avoid mutual conflicts between explicit and implicit modeling, this paper chooses a parallel structure. It selects an appropriate feature fusion strategy to fuse parallel branch features to improve model robustness. Meanwhile, to make the model pay more attention to the hard-to-identify vital points, this paper adopts the focal loss function to optimize the model and improve the data’s long-tailed distribution. We validate our work on the publicly available dataset MPII with the MSCOCO dataset. The results show that our proposed method has the effect of refining the keypoints response, with a PCKh@0.5 of 92.51% on the official MPII validation dataset, and achieved a result of 78% AP on the official MSCOCO validation dataset, which is comparable to the performance of existing state-of-the-art models.
ISSN:2169-3536