RTMPose vs RTMO vs DWPose vs RTMW #3135
Unanswered
davidpagnon
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi everyone,
I would appreciate it if you could help me understand a few details about RTMPose, RTMO, DWPose, and RTMW.
Some RTMW models have the "dw" suffix: rtmw-dw-x-l_simcc-cocktail14_270e-384x288
Some RTMpose models have the "dw" suffix, and seem to be trained on a less extensive dataset: rtmpose-l_simcc-ucoco_dw-ucoco_270e-384x288
Can you help me understand the logic behind it?
Ex for Coco 17: RTMPose-s: AP 72.2, while RTMPose-s*: AP 69.7
Here is what I found for now:
RTMPose: Top down (detection and then pose estimation).
Supports Halpe26 (with feet)
RTMW: RTMPose for whole-body (face, feet, hands), further extends to 3D
RTMO: One stage (no detection needed).
Faster with more than 4 people, slightly worse accuracy, only 17 points
DWPose: Uses distillation (teacher and student models).
Whole-body, faster than RTMW but less accurate (although not trained on same datasets so AP should be taken with caution)
YOLOX: Similar speed as RTMO but less accurate, no feet
Beta Was this translation helpful? Give feedback.
All reactions