Using multiple cues for hand tracking and model refinement

We present a model based approach to the integration of multiple cues for tracking high degree of freedom articulated motions and model refinement. We then apply it to the problem of hand tracking using a single camera sequence. Hand tracking is particularly challenging because of occlusions, shading variations, and the high dimensionality of the motion. The novelty of our approach is in the combination of multiple sources of information, which come from edges, optical flow, and shading information in order to refine the model during tracking. We first use a previously formulated generalized version of the gradient-based optical flow constraint, that includes shading flow i.e., the variation of the shading of the object as it rotates with respect to the light source. Using this model we track its complex articulated motion in the presence of shading changes. We use a forward recursive dynamic model to track the motion in response to data derived 3D forces applied to the model. However, due to inaccurate initial shape, the generalized optical flow constraint is violated. We use the error in the generalized optical flow equation to compute generalized forces that correct the model shape at each step. The effectiveness of our approach is demonstrated with experiments on a number of different hand motions with shading changes, rotations and occlusions of significant parts of the hand.

[1]  Katsushi Ikeuchi,et al.  Toward automatic robot instruction from perception-recognizing a grasp from observation , 1993, IEEE Trans. Robotics Autom..

[2]  Thomas S. Huang,et al.  Vision based hand modeling and tracking for virtual teleconferencing and telecollaboration , 1995, Proceedings of IEEE International Conference on Computer Vision.

[3]  Rómer Rosales,et al.  3D Hand Pose Reconstruction Using Specialized Mappings , 2001, ICCV.

[4]  J. Baumgarte Stabilization of constraints and integrals of motion in dynamical systems , 1972 .

[5]  Tomaso A. Poggio,et al.  Motion Field and Optical Flow: Qualitative Properties , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Dimitris N. Metaxas Physics-Based Deformable Models: Applications to Computer Vision, Graphics, and Medical Imaging , 1996 .

[7]  Dimitris N. Metaxas,et al.  Human Motion Planning Based on Recursive Dynamics and Optimal Control Techniques , 2002 .

[8]  Tosiyasu L. Kunii,et al.  Model-based analysis of hand posture , 1995, IEEE Computer Graphics and Applications.

[9]  Frank Uhlig,et al.  Numerical Algorithms with C , 1996 .

[10]  Yoshiaki Shirai,et al.  Hand gesture estimation and model refinement using monocular camera-ambiguity limitation by inequality constraints , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[11]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[12]  Jorge Angeles,et al.  Dynamic Simulation of n-Axis Serial Robotic Manipulators Using a Natural Orthogonal Complement , 1988, Int. J. Robotics Res..

[13]  Shahriar Negahdaripour,et al.  Revised Definition of Optical Flow: Integration of Radiometric and Geometric Cues for Dynamic Scene Analysis , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Olivier D. Faugeras,et al.  Finding pose of hand in video images: a stereo-based approach , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[15]  Dimitris N. Metaxas,et al.  Human Motion Planning Based on Recursive Dynamics and Optimal Control Techniques , 2000, Proceedings Computer Graphics International 2000.

[16]  Brian A. Barsky,et al.  Using dynamic analysis to animate articulated bodies such as humans and robots , 1985 .

[17]  Dimitris N. Metaxas,et al.  Recursive dynamics and optimal control techniques for human motion planning , 1999, Proceedings Computer Animation 1999.

[18]  Dimitris N. Metaxas,et al.  Adjusting Shape Parameters Using Model-Based Optical Flow Residuals , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  J FleetDavid,et al.  Computing Optical Flow with Physical Models of Brightness Variation , 2001 .

[20]  Reinhard Koch,et al.  Dynamic 3-D Scene Analysis Through Synthesis Feedback Control , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  David J. Fleet,et al.  Computing Optical Flow with Physical Models of Brightness Variation , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Ying Wu,et al.  Capturing articulated human hand motion: a divide-and-conquer approach , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[23]  Kiriakos N. Kutulakos,et al.  Multi-View Scene Capture by Surfel Sampling: From Video Streams to Non-Rigid 3D Motion, Shape and Reflectance , 2002, International Journal of Computer Vision.

[24]  Roy Featherstone,et al.  Robot Dynamics Algorithms , 1987 .

[25]  Takeo Kanade,et al.  Model-based tracking of self-occluding articulated objects , 1995, Proceedings of IEEE International Conference on Computer Vision.

[26]  Ying Wu,et al.  Capturing natural hand articulation , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[27]  Dimitris N. Metaxas,et al.  Incorporating illumination constraints in deformable models , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[28]  A. James Stewart,et al.  Beyond keyframing: an algorithmic approach to animation , 1992 .

[29]  Dimitris N. Metaxas,et al.  Optical Flow Constraints on Deformable Models with Applications to Face Tracking , 2000, International Journal of Computer Vision.

[30]  Y. J. Tejwani,et al.  Robot vision , 1989, IEEE International Symposium on Circuits and Systems,.