Backpropagation and stochastic gradient descent method

[1]  Shun-ichi Amari,et al.  Four Types of Learning Curves , 1992, Neural Computation.

[2]  Shun-ichi Amari,et al.  Information geometry of Boltzmann machines , 1992, IEEE Trans. Neural Networks.

[3]  Heskes,et al.  Learning processes in neural networks. , 1991, Physical review. A, Atomic, molecular, and optical physics.

[4]  Shun-ichi Amari,et al.  Dualistic geometry of the manifold of higher-order neurons , 1991, Neural Networks.

[5]  Richard Lippmann,et al.  Neural Network Classifiers Estimate Bayesian a posteriori Probabilities , 1991, Neural Computation.

[6]  Barak A. Pearlmutter,et al.  Equivalence Proofs for Multi-Layer Perceptron Classifiers and the Bayesian Discriminant Function , 1991 .

[7]  Shun-ichi Amari,et al.  Mathematical foundations of neurocomputing , 1990, Proc. IEEE.

[8]  T Poggio,et al.  Regularization Algorithms for Learning That Are Equivalent to Multilayer Networks , 1990, Science.

[9]  Halbert White,et al.  Learning in Artificial Neural Networks: A Statistical Perspective , 1989, Neural Computation.

[10]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[11]  Shun-ichi Amari,et al.  Differential-geometrical methods in statistics , 1985 .

[12]  S. Amari Differential Geometry of Statistical Models , 1985 .

[13]  T. Kohonen Self-organized formation of topographically correct feature maps , 1982 .

[14]  P. Werbos,et al.  Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[15]  M. T. Wasan Stochastic Approximation , 1969 .

[16]  Shun-ichi Amari,et al.  A Theory of Adaptive Pattern Classifiers , 1967, IEEE Trans. Electron. Comput..

[17]  A. A. Mullin,et al.  Principles of neurodynamics , 1962 .

[18]  H. D. Block The perceptron: a model for brain functioning. I , 1962 .

[19]  H. D. Block,et al.  Analysis of a Four-Layer Series-Coupled Perceptron. II , 1962 .

[20]  S. Kullback Information Theory and Statistics , 1959 .