This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Search for Publication

Year(s) from:  to 
Keywords (separated by spaces):

A Large-Scale Database of Images and Captions for Automatic Face Naming

M. Ozcan, L. Jie, V. Ferrari, and B. Caputo
Proceedings of the British Machine Vision Conference
September 2011


We present a large scale database of images and captions, designed for supporting research on how to use captioned images from the Web for training visual classifiers. It consists of more than 125,000 images of celebrities from different fields downloaded from the Web. Each image is associated to its original text caption, extracted from the html page the image comes from. We coin it FAN-Large, for Face And Names Large scale database. Its size and deliberate high level of noise makes it to our knowledge the largest and most realistic database supporting this type of research. The dataset and its annotations are publicly available and can be obtained from We report results on a thorough assessment of FAN-Large using several existing approaches for name-face association, and present and evaluate new contextual features derived from the caption. Our findings provide important cues on the strengths and limitations of existing approaches

Download in pdf format
  author = {M. Ozcan and L. Jie and V. Ferrari and and B. Caputo},
  title = {A Large-Scale Database of Images and Captions for Automatic Face Naming},
  booktitle = {Proceedings of the British Machine Vision Conference},
  year = {2011},
  month = {September},
  keywords = {}