Publications

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Search for Publication


Year(s) from:  to 
Author:
Keywords (separated by spaces):

Action Snippets: How Many Frames Does Human Action Recognition Require?

K. Schindler and L. van Gool
IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08)
Anchorage, AK, June 2008

Abstract

Visual recognition of human actions in video clips has been an active field of research in recent years. However, most published methods either analyse an entire video and assign it a single action label, or use relatively large look-ahead to classify each frame. Contrary to these strategies, human vision proves that simple actions can be recognised almost instantaneously. In this paper, we present a system for action recognition from very short sequences ("snippets") of 1-10 frames, and systematically evaluate it on standard data sets. It turns out that even local shape and optic flow for a single frame are enough to achieve ~90% correct recognitions, and snippets of 5-7 frames (0.3-0.5 seconds of video) are enough to achieve a performance similar to the one obtainable with the entire video sequence.


Download in pdf format
@InProceedings{eth_biwi_00532,
  author = {K. Schindler and L. van Gool},
  title = {Action Snippets: How Many Frames Does Human Action Recognition Require?},
  booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08)},
  year = {2008},
  month = {June},
  publisher = {IEEE Press},
  keywords = {}
}