This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Search for Publication

Year(s) from:  to 
Keywords (separated by spaces):

Combining Densely Sampled Form and Motion for Human Action Recognition

K. Schindler and L. van Gool
DAGM Annual Pattern Recognition Symposium
Muenchen, Germany, June 2008


We present a method for human action recognition from video, which exploits both form (local shape) and motion (local flow). Inspired by models of the human visual system, the two feature sets are processed independently in separate channels. The form channel extracts a dense local shape representation from every frame, while the motion channel extracts dense optic flow from the frame and its immediate predecessor. The same processing pipeline is applied in both channels: feature maps are pooled locally, down-sampled, and compared to a collection of learnt templates, yielding a vector of similarity scores. In a final step, the two score vectors are merged, and recognition is performed with a discriminative classifier. In an evaluation on two standard datasets our method outperforms the state-of-the-art, confirming that the combination of form and motion improves recognition.

Download in pdf format
  author = {K. Schindler and L. van Gool},
  title = {Combining Densely Sampled Form and Motion for Human Action Recognition},
  booktitle = {DAGM Annual Pattern Recognition Symposium},
  year = {2008},
  month = {June},
  publisher = {Springer},
  keywords = {}