This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Search for Publication

Year(s) from:  to 
Keywords (separated by spaces):

Distance Metric Learning from Uncertain Side Information with Application to Automated Photo Tagging

L. Wu, S. C. Hoi, R. Jin, J. Zhu and N. Yu
ACM Multimedia 2009
Beijing, China, October 2009


Automated photo tagging is essential to making massive unlabeled photos searchable by text search engines. Conventional image annotation approaches, though working reasonably well on small testbeds, are either computationally expensive or inaccurate when dealing with large-scale photo tagging. Recently, with the popularity of social networking websites, we observe a massive number of user-tagged images, referred to as “social images”, that are available on the web. Unlike traditional web images, social images often contain tags and other user-generated content, which offer a new opportunity to resolve some long-standing challenges in multimedia. In this work, we aim to address the challenge of large-scale automated photo tagging by exploring the social images. We present a retrieval based approach for automated photo tagging. To tag a test image, the proposed approach first retrieves k social images that share the largest visual similarity with the test image. The tags of the test image are then derived based on the tagging of the similar images. Due to the well-known semantic gap issue, a regular Euclidean distance-based retrieval method often fails to find semantically relevant images. To address the challenge of semantic gap, we propose a novel probabilistic distance metric learning scheme that (1) automatically derives constraints from the uncertain side information, and (2) effciently learns a distance metric from the derived constraints. We apply the proposed technique to automated photo tagging tasks based on a social image testbed with over 200,000 images crawled from Flickr. Encouraging results show that the proposed technique is effective and promising for automated photo tagging.

Link to publisher's page
  author = {L. Wu and S. C. Hoi and R. Jin and J. Zhu and N. Yu},
  title = {Distance Metric Learning from Uncertain Side Information with Application to Automated Photo Tagging},
  booktitle = {ACM Multimedia 2009},
  year = {2009},
  month = {October},
  pages = {135-144},
  publisher = {ACM New York, NY, USA},
  keywords = {automated photo tagging, distance metric learning, uncertain side information }