Hidden Markov models with graph densities for action recognition

Human action recognition in video streams is a fast developing field in pattern recognition and machine learning. Local image representations, e.g. space-time interest points [1], have proven to be the current most reliable choice of feature in sequences in which the region of interest is difficult to determine [2]. However, the question how to deal with more severe occlusions has large been ignored [2]. This work proposes a new approach which directly addresses heavy occlusions by modeling the skeleton-based features using a probability density functions (PDF) defined over graphs. We integrated the proposed density into an hidden Markov model (HMM) to model sequences of graphs of arbitrary sizes, i.e. occlusions setting may change over time. The approach is evaluated using a dataset embracing three action classes, studying six different types of occlusions (involving the removal of subgraphs from the graphical representation of action sequence). The presented study shows clearly that actions from even heavily occluded sequences can be reliably recognized.

[1]  Efstratios Gallopoulos,et al.  Computer Vision and Action Recognition A Guide for Image Processing and Computer Vision Community for Action Understanding , 2012 .

[2]  Hsuan-Sheng Chen,et al.  Human action recognition using star skeleton , 2006, VSSN '06.

[3]  Ronald Poppe,et al.  A survey on vision-based human action recognition , 2010, Image Vis. Comput..

[4]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[5]  Shyamsundar Rajaram,et al.  Human Activity Recognition Using Multidimensional Indexing , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Günther Palm,et al.  Recognizing Human Activities Using a Layered Markov Architecture , 2012, ICANN.

[7]  Edmondo Trentin,et al.  Unbiased SVM Density Estimation with Application to Graphical Pattern Recognition , 2007, ICANN.

[8]  Yaser Sheikh,et al.  Exploring the space of a human action , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[9]  Ivan Laptev,et al.  On Space-Time Interest Points , 2005, International Journal of Computer Vision.

[10]  Bart Selman,et al.  Unstructured human activity detection from RGBD images , 2011, 2012 IEEE International Conference on Robotics and Automation.

[11]  S. Ishikawa,et al.  Human activity recognition: Various paradigms , 2008, 2008 International Conference on Control, Automation and Systems.

[12]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[13]  Günther Palm,et al.  A generic framework for the inference of user states in human computer interaction , 2012, Journal on Multimodal User Interfaces.

[14]  G. West,et al.  Human Action Recognition with an Incomplete Real-Time Pose Skeleton , 2004 .

[15]  Günther Palm,et al.  Detecting Actions by Integrating Sequential Symbolic and Sub-symbolic Information in Human Activity Recognition , 2012, MLDM.

[16]  Martin A. Giese,et al.  Learning Representations for Animated Motion Sequence and Implied Motion Recognition , 2012, ICANN.

[17]  Rama Chellappa,et al.  Machine Recognition of Human Activities: A Survey , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[18]  Md. Atiqur Rahman Ahad Computer Vision and Action Recognition - A Guide for Image Processing and Computer Vision Community for Action Understanding , 2011, Atlantis Ambient and Pervasive Intelligence.

[19]  Edmondo Trentin,et al.  A Simple and Effective Neural Model for the Classification of Structured Patterns , 2007, KES.