论文信息 - Complex-valued convolutional networks yield data-driven multiscale windowed spectra

Complex-valued convolutional networks yield data-driven multiscale windowed spectra

Abstract: A complex-valued convolutional network (convnet) implements the repeated application of the following composition of three operations, recursively applying the composition to an input vector of nonnegative real numbers: (1) convolution with complex-valued vectors followed by (2) taking the absolute value of every entry of the resulting vectors followed by (3) local averaging. For processing real-valued random vectors, complex-valued convnets can be viewed as “data-driven multiscale windowed power spectra,” “data-driven multiscale windowed absolute spectra,” “datadriven multiwavelet absolute values,” or (in their most general configuration) “data-driven nonlinear multiwavelet packets.” Indeed, complex-valued convnets can calculate multiscale windowed spectra when the convnet filters are windowed complex-valued exponentials. Standard real-valued convnets, using rectified linear units (ReLUs), sigmoidal (for example, logistic or tanh) nonlinearities, max. pooling, etc., do not obviously exhibit the same exact correspondence with data-driven wavelets (whereas for complex-valued convnets, the correspondence is much more than just a vague analogy). Courtesy of the exact correspondence, the remarkably rich and rigorous body of mathematical analysis for wavelets applies directly to (complex-valued) convnets.

[1] Olaf Hellwich,et al. Complex-Valued Convolutional Neural Networks for Object Detection in PolSAR data , 2010 .

[2] S. Mallat,et al. Invariant Scattering Convolution Networks , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3] Stéphane Mallat,et al. Deep roto-translation scattering for object classification , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Stéphane Mallat,et al. Locally stationary covariance and signal estimation with macrotiles , 2003, IEEE Trans. Signal Process..

[5] Yuandong Tian,et al. Scale-invariant learning and convolutional networks , 2015, ArXiv.

[6] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..

[7] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[8] Y. Meyer. Wavelets and Operators , 1993 .

[9] Luc Van Gool,et al. Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[10] S. Mallat. Recursive interferometric representations , 2010, 2010 18th European Signal Processing Conference.

[11] David J. Schwab,et al. An exact mapping between the Variational Renormalization Group and Deep Learning , 2014, ArXiv.

[12] Matthijs C. Dorst. Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[13] Ronald R. Coifman,et al. Signal processing and compression with wavelet packets , 1994 .

[14] D. Donoho,et al. Translation-Invariant DeNoising , 1995 .

[15] Ronald R. Coifman,et al. Local discriminant bases and their applications , 1995, Journal of Mathematical Imaging and Vision.

[16] S. Mallat,et al. Intermittent process analysis with scattering moments , 2013, 1311.4104.

[17] Stéphane Mallat,et al. A Wavelet Tour of Signal Processing - The Sparse Way, 3rd Edition , 2008 .

[18] David G. Lowe,et al. Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[19] William T. Freeman,et al. Presented at: 2nd Annual IEEE International Conference on Image , 1995 .

[20] Lorenzo Rosasco,et al. The computational magic of the ventral stream: sketch of a theory (and why some deep architectures work). , 2012 .

[21] Ronald W. Schafer,et al. Introduction to Digital Speech Processing , 2007, Found. Trends Signal Process..

[22] Eero P. Simoncelli,et al. On Advances in Statistical Modeling of Natural Images , 2004, Journal of Mathematical Imaging and Vision.

[23] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[24] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[25] I. Daubechies. Ten Lectures on Wavelets , 1992 .

[26] Y. Meyer,et al. Wavelets: Calderón-Zygmund and Multilinear Operators , 1997 .

[27] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).