FMMP (CVPR'2025)

Official code repository for the paper:
Recurrent Feature Mining and Keypoint Mixup Padding for Category-Agnostic Pose Estimation
[Junjie Chen, Weilong Chen, Yifan Zuo, Yuming Fang]

Abstract

Category-agnostic pose estimation aims to locate keypoints on query images according to a few annotated support images for arbitrary novel classes. Existing methods generally extract support features via heatmap pooling, and obtain interacted features from support and query via cross-attention. Hence, these works neglect to mine fine-grained and structure-aware (FGSA) features from both support and query images, which are crucial for pixel-level keypoint localization. To this end, we propose a novel yet concise framework, which recurrently mines FGSA features from both support and query images. Specifically, we design a FGSA mining module based on deformable attention mechanism. On the one hand, we mine fine-grained features by applying deformable attention head over multi-scale feature maps. On the other hand, we mine structure-aware features by offsetting the reference points of keypoints to their linked keypoints. By means of above module, we recurrently mine FGSA features from support and query images, and thus obtain better support features and query estimations. In addition, we propose to use mixup keypoints to pad various classes to a unified keypoint number, which could provide richer supervision than the zero padding used in existing works. We conduct extensive experiments and in-depth studies on large-scale MP-100 dataset, and outperform SOTA method dramatically.

Usage

Install

The installation is similar to CapeFormer, detailed packages could be found in cape_environment.yml.

Data preparation

Please follow the official guide to prepare the MP-100 dataset for training and evaluation, and organize the data structure properly.

Alternatively, we employ an unified annotation file (i.e., unified_ann_file.json) and adopt valid_class_ids to set various splits.

Training and Test

The scripts are similar to CapeFormer, and detailed scripts could be found in install.sh. Pretrained weights (e.g., mAP: ~78.7%) are available at GtYe.

Citation

@inproceedings{FMMP,
  title={Recurrent Feature Mining and Keypoint Mixup Padding for Category-Agnostic Pose Estimation},
  author={Junjie Chen, Weilong Chen, Yifan Zuo, Yuming Fang},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year={2025}
}

Acknowledgement

Thanks to:

License

This project is released under the Apache 2.0 license.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
cape_extloc		cape_extloc
capeformer		capeformer
LICENSE		LICENSE
README.md		README.md
install.sh		install.sh
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
test.py		test.py
train.py		train.py
train_capeformer.py		train_capeformer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FMMP (CVPR'2025)

Abstract

Usage

Install

Data preparation

Training and Test

Citation

Acknowledgement

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

chenbys/FMMP

Folders and files

Latest commit

History

Repository files navigation

FMMP (CVPR'2025)

Abstract

Usage

Install

Data preparation

Training and Test

Citation

Acknowledgement

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages