This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Search for Publication

Year(s) from:  to 
Keywords (separated by spaces):

Face recognition from caption-based supervision

Matthieu Guillaumin, Thomas Mensink, Jakob Verbeek, Cordelia Schmid
International Journal of Computer Vision (IJCV)
Vol. 96, No. 1, pp. 64-82, January 2012


In this paper, we present methods for face recognition using a collection of images with captions. We consider two tasks: retrieving all faces of a particular person in a data set, and establishing the correct association between the names in the captions and the faces in the images. This is challenging because of the very large appearance variation in the images, as well as the potential mismatch between images and their captions. For both tasks, we compare generative and discriminative probabilistic models, as well as methods that maximize subgraph densities in similarity graphs. We extend them by considering different metric learning techniques to obtain appropriate face representations that reduce intra person variability and increase inter person separation. For the retrieval task, we also study the benefit of query expansion. To evaluate performance, we use a new fully labeled data set of 31147 faces which extends the recent Labeled Faces in the Wild data set. We present extensive experimental results which show that metric learning significantly improves the performance of all approaches on both tasks.

Link to publisher's page
Download in pdf format
  author = {Matthieu Guillaumin and Thomas Mensink and Jakob Verbeek and Cordelia Schmid},
  title = {Face recognition from caption-based supervision},
  journal = {International Journal of Computer Vision (IJCV)},
  year = {2012},
  month = {January},
  pages = {64-82},
  volume = {96},
  number = {1},
  keywords = {Face recognition, Metric Learning, Weakly supervised learning, Face retrieval, Constrained clustering}