This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Search for Publication

Year(s) from:  to 
Keywords (separated by spaces):

Learning object classes with generic knowledge

Thomas Deselaers, Bogdan Alexe and Vittorio Ferrari
275, 2011
, in press Computer Vision Lab, ETH Zuerich


Learning a new object class from cluttered training images is very challenging when the location of object instances is unknown, i.e. in a weakly supervised setting. Many previous works require objects covering a large portion of the images. We present a novel approach that can cope with extensive clutter as well as large scale and appearance variations between object instances. To make this possible we exploit generic knowledge learned beforehand from images of other classes for which location annotation is available. Generic knowledge facilitates learning any new class from weakly supervised images, because it reduces the ambiguity in the location of its object instances. We propose a conditional random field that starts from generic knowledge and then progressively adapts to the new class. Our approach simultaneously localizes object instances while learning an appearance model specific for the class. We demonstrate this on several datasets, including the very challenging Pascal VOC 2007. Furthermore, our method enables to train any state-of-the-art object detector in a weakly supervised fashion, although it would normally require object location annotations.

Link to publisher's page
Download in pdf format
  author = {Thomas Deselaers and Bogdan Alexe and Vittorio Ferrari},
  title = {Learning object classes with generic knowledge},
  year = {2011},
  month = {August},
  number = {275},
  institution = {Computer Vision Lab, ETH Zuerich},
  keywords = {object detection, weakly supervised learning, transfer learning, conditional random fields},
  note = {in press}