This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Search for Publication

Year(s) from:  to 
Keywords (separated by spaces):

Retrieving Objects From Videos Based on Affine Regions

V. Ferrari, T. Tuytelaars, and L. Van Gool
Proceedings of European Signal Processing conference (EUSIPCO)
Vienna, Austria, September 2004


We present a method to (semi-)automatically annotate video material. More precisely, we focus on recognizing specific objects and scenes in keyframes. Objects are learnt simply by having the user delineate them in one (or a few) images. The basic building block to achieve this goal consists of affine invariant regions. These are local image patches that adapt their shape based on the image content so as to be invariant to viewpoint changes. Instead of simply matching the regions and counting the number of matches, we propose to gather more evidence about the presence of the object by exploring the image around the initial matches. This boosts the performance, especially under difficult, real-world imaging conditions. Experimental results on news broadcast data demonstrate the viability of the approach.

Download in pdf format
  author = {V. Ferrari and T. Tuytelaars and and L. Van Gool},
  title = {Retrieving Objects From Videos Based on Affine Regions},
  booktitle = {Proceedings of European Signal Processing conference (EUSIPCO)},
  year = {2004},
  month = {September},
  pages = {128-131},
  keywords = {video retrieval, object recognition, local features}