Pose Estimation is a general problem in Computer Vision where we detect the position and orientation of an object.
( Image credit: Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose )
|TREND||DATASET||BEST METHOD||PAPER TITLE||PAPER||CODE||COMPARE|
Our approach efficiently detects objects in an image while simultaneously generating a high-quality segmentation mask for each instance.
Ranked #1 on Real-Time Object Detection on COCO minival (MAP metric)
3D INSTANCE SEGMENTATION HUMAN PART SEGMENTATION KEYPOINT DETECTION MULTI-HUMAN PARSING MULTI-PERSON POSE ESTIMATION MULTI-TISSUE NUCLEUS SEGMENTATION NUCLEAR SEGMENTATION PANOPTIC SEGMENTATION REAL-TIME OBJECT DETECTION
We demonstrate this framework on 3D pose estimation by proposing a differentiable objective that seeks the optimal set of keypoints for recovering the relative pose between two views of an object.
With recent progress in graphics, it has become more tractable to train models on synthetic images, potentially avoiding the need for expensive annotations.
In this work, we establish dense correspondences between RGB image and a surface-based representation of the human body, a task we refer to as dense human pose estimation.
Ranked #2 on Pose Estimation on DensePose-COCO
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
We address two problems: first, we establish an easy method for capturing and labeling 3D keypoints on desktop objects with an RGB camera; and second, we develop a deep neural network, called $KeyPose$, that learns to accurately predict object poses using 3D keypoints, from stereo input, and works even for transparent objects.
High-resolution representations are essential for position-sensitive vision problems, such as human pose estimation, semantic segmentation, and object detection.
Ranked #2 on Semantic Segmentation on CamVid