Coarse-to-Fine Visual Selection

We study visual selection: Detect and roughly localize all instances of a generic object class, such as a face, in a greyscale scene, measuring performance in terms of computation and false alarms. Our approach is sequential testing which is coarse-tone in both in the exploration of poses and the representation of objects. All the tests are all binary and indicate the presence or absence of loose spatial arrangements of oriented edge fragments. Starting from training examples, we recursively nd larger and larger arrangements which are "decomposable," which implies the probability of an arrangement appearing on an object decays slowly with its size. Detection means nding a suucient number of arrangements of each size along a decreasing sequence of pose cells. At the beginning, the tests are simple and universal, accommodating many poses simultaneously, but the false alarm rate is relatively high. Eventually, the tests are more discriminating, but also more complex and dedicated to speciic poses. As a result, the spatial distribution of processing is highly skewed and detection is rapid, but at the expense of (isolated) false alarms which, presumably, could be eliminated with localized, more intensive, processing.

[1]  Kongqiao Wang,et al.  A hierarchical multiscale and multiangle system for human face detection in a complex background using gravity-center template , 1999, Pattern Recognit..

[2]  Donald Geman,et al.  An Active Testing Model for Tracking Roads in Satellite Images , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Donald Geman,et al.  Decision tree algorithms for handwritten digit recognition , 1998 .

[4]  Takeo Kanade,et al.  Neural Network-Based Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Christoph von der Malsburg,et al.  Tracking and learning graphs and pose on image sequences of faces , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[6]  Timothy F. Cootes,et al.  Locating faces using statistical feature detectors , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[7]  Yali Amit,et al.  Shape Quantization and Recognition with Randomized Trees , 1997, Neural Computation.

[8]  Takao Akatsuka,et al.  Multi-module method for detection of human face from complex background , 1998 .

[9]  Yehezkel Lamdan,et al.  Object recognition by affine invariant matching , 2011, Proceedings CVPR '88: The Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  Yali Amit,et al.  A Neural Network Architecture for Visual Selection , 2000, Neural Computation.

[11]  S. Ullman High-Level Vision: Object Recognition and Visual Cognition , 1996 .

[12]  William Grimson,et al.  Object recognition by computer - the role of geometric constraints , 1991 .

[13]  Vladimir Vapnik,et al.  The Nature of Statistical Learning , 1995 .

[14]  Michael C. Burl,et al.  Finding Faces in Cluttered Scenes Using Labeled Random Graph Matching. , 1995, ICCV 1995.

[15]  Federico Girosi,et al.  Training support vector machines: an application to face detection , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Yali Amit,et al.  A Computational Model for Visual Selection , 1999, Neural Computation.

[17]  Qian Chen,et al.  Face Detection From Color Images Using a Fuzzy Pattern Matching Method , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  David Haussler,et al.  What Size Net Gives Valid Generalization? , 1989, Neural Computation.

[19]  Eli Saber,et al.  Frontal-view face detection and facial feature extraction using color, shape and symmetry based cost functions , 1998, Pattern Recognit. Lett..