This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Search for Publication

Year(s) from:  to 
Keywords (separated by spaces):

2D Articulated Human Pose Estimation and Retrieval in (Almost) Unconstrained Still Images

M. Eichner, M. Marin-Jimenez, A. Zisserman, V. Ferrari
International Journal of Computer Vision
Vol. 99, No. 2, pp. 190-214, September 2012


We present a technique for estimating the spatial layout of humans in still images -- the position of the head, torso and arms. The theme we explore is that once a person is localized using an upper body detector, the search for their body parts can be considerably simplified using weak constraints on position and appearance arising from that detection. Our approach is capable of estimating upper body pose in highly challenging uncontrolled images, without prior knowledge of background, clothing, lighting, or the location and scale of the person in the image. People are only required to be upright and seen from the front or the back (not side). We evaluate the stages of our approach experimentally using ground truth layout annotation on a variety of challenging material, such as images from the PASCAL VOC 2008 challenge and video frames from TV shows and feature films. We also propose and evaluate techniques for searching a video dataset for people in a specific pose. To this end, we develop three new pose descriptors and compare their classification and retrieval performance to two baselines built on state-of-the-art object detection models.

Download in pdf format
  author = {M. Eichner and M. Marin-Jimenez and A. Zisserman and V. Ferrari},
  title = {2D Articulated Human Pose Estimation and Retrieval in (Almost) Unconstrained Still Images},
  journal = {International Journal of Computer Vision},
  year = {2012},
  month = {September},
  pages = {190-214},
  volume = {99},
  number = {2},
  keywords = {}