Kinematic-aware Hierarchical Attention Network for Human Pose Estimation in Videos (WACV 2023)

https://openaccess.thecvf.com/content/WACV2023/papers/Jin_Kinematic-Aware_Hierarchical_Attention_Network_for_Human_Pose_Estimation_in_Videos_WACV_2023_paper.pdf

Contributions

We propose a novel approach HANet that utilizes the keypoints’ kinematic features, following the laws of physics. Our method addresses temporal issues with these proposed features, effectively mitigates the jitter, and becomes robust to occlusion.
We propose a hierarchical transformer encoder that incorporates multi-scale spatio-temporal attention. We use multi-scale feature maps, i.e., leverage all layers’ attention maps, and improve performance on benchmarks that provide sparse supervision.
We propose online mutual learning that enables joint optimization between refined input poses and final poses, which chooses an online learning target by their training losses.
We conduct extensive experiments on large datasets and demonstrate that our framework improves performance on tasks: 2D pose estimation, 3D pose estimation, body mesh recovery, and sparsely-annotated multi-human 2D pose estimation.

Getting Started

Environment Requirement

Clone the repo:

https://github.com/KyungMinJin/HANet.git

Install the HANet requirements using conda:

# conda
conda create env --name HANet python=3.6
conda activate HANet
pip install -r requirements.txt

Prepare Data

Sub-JHMDB data used in our experiment can be downloaded here. Refer to Official DeciWatch Repository for more details about the data arrangement.

Google Drive

Dataset	Pose Estimator	3D Pose	2D Pose	SMPL
Sub-JHMDB	SimpleBaseline		✔

Training

Note that datasets should be downloaded and prepared before training.

Run the commands below to start training on Sub-JHMDB:

python train.py --cfg configs/config_jhmdb_simplebaseline_2D.yaml --dataset_name jhmdb --estimator simplebaseline --body_representation 2D

Evaluation

Results on 2D Pose:

Dataset	Estimator	PCK 0.05 (Input/Output):arrow_up:	PCK 0.1 (Input/Output):arrow_up:	PCK 0.2 (Input/Output):arrow_up:	Checkpoint
Sub-JHMDB	simplebaseline	57.3%/91.9%	81.6%/98.3%	93.9%/99.6%	Google Drive

Results on 3D Pose:

Dataset	Estimator	MPJPE (Input/Output):arrow_down:	Accel (Input/Output):arrow_down:
Human3.6M	FCN	54.6/52.8	19.2/1.4
Human3.6M	Mhformer	38.3/35.4	0.8/0.8
3DPW	PARE	78.9/77.1	6.9/6.8
AIST++	SPIN	107.7/69.2	5.7/5.4

Visualization

We prepare all visualization codes as soon as possible.

2D Pose

Visualize comparison on Sub-JHMDB

3D Pose

Visualize comparison on AIST++

3D Body Mesh Recovery

Visualize comparison on 3DPW

Visualize comparison on AIST++

Citation

@inproceedings{jin2023kinematic,
  title={Kinematic-aware Hierarchical Attention Network for Human Pose Estimation in Videos},
  author={Jin, Kyung-Min and Lim, Byoung-Sung and Lee, Gun-Hee and Kang, Tae-Kyung and Lee, Seong-Whan},
  booktitle={Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision},
  pages={5725--5734},
  year={2023}
}

Acknowledgement

The code is based on Deciwatch. Thanks for their well-organized code!

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.idea		.idea
configs		configs
docs/assets		docs/assets
lib		lib
.gitignore		.gitignore
README.md		README.md
eval.py		eval.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kinematic-aware Hierarchical Attention Network for Human Pose Estimation in Videos (WACV 2023)

Contributions

Getting Started

Environment Requirement

Prepare Data

Training

Evaluation

Visualization

2D Pose

3D Pose

3D Body Mesh Recovery

Citation

Acknowledgement

About

Releases

Packages

Languages

KyungMinJin/HANet

Folders and files

Latest commit

History

Repository files navigation

Kinematic-aware Hierarchical Attention Network for Human Pose Estimation in Videos (WACV 2023)

Contributions

Getting Started

Environment Requirement

Prepare Data

Training

Evaluation

Visualization

2D Pose

3D Pose

3D Body Mesh Recovery

Citation

Acknowledgement

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages