论文信息 - Statistical analysis of learning dynamics - 字舞流文

Statistical analysis of learning dynamics

Shun-ichi Amari | Noboru Murata | S. Amari | N. Murata | Noboru Murata

[1] Vladimir N. Vapnik,et al. The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[2] Andrzej Cichocki,et al. Stability Analysis of Learning Algorithms for Blind Source Separation , 1997, Neural Networks.

[3] Klaus-Robert Müller,et al. Asymptotic statistical theory of overtraining and cross-validation , 1997, IEEE Trans. Neural Networks.

[4] Andreas Ziehe,et al. Adaptive On-line Learning in Changing Environments , 1996, NIPS.

[5] Steve Rogers,et al. Adaptive Filter Theory , 1996 .

[6] Klaus Schulten,et al. A Numerical Study on Learning Curves in Stochastic Multilayer Feedforward Networks , 1996, Neural Computation.

[7] M.H. Hassoun,et al. Fundamentals of Artificial Neural Networks , 1996, Proceedings of the IEEE.

[8] Michael Biehl,et al. Learning by on-line gradient descent , 1995 .

[9] Jong-Hoon Oh,et al. Neural networks : the statistical mechanics perspective : proceedings of the CTP-PBSRI Joint Workshop on Theoretical Physics, POSTECH, Pohang, Korea, 2-4 February 95 , 1995 .

[10] Haim Sompolinsky,et al. On-line Learning of Dichotomies: Algorithms and Learning Curves. , 1995, NIPS 1995.

[11] Yoshua Bengio,et al. Pattern Recognition and Neural Networks , 1995 .

[12] Shun-ichi Amari,et al. Network information criterion-determining the number of hidden units for an artificial neural network model , 1994, IEEE Trans. Neural Networks.

[13] Pierre Comon,et al. Independent component analysis, A new concept? , 1994, Signal Process..

[14] G. Kane. Parallel Distributed Processing: Explorations in the Microstructure of Cognition, vol 1: Foundations, vol 2: Psychological and Biological Models , 1994 .

[15] S. Hyakin,et al. Neural Networks: A Comprehensive Foundation , 1994 .

[16] Andrew R. Barron,et al. Universal approximation bounds for superpositions of a sigmoidal function , 1993, IEEE Trans. Inf. Theory.

[17] Shun-ichi Amari,et al. A universal theorem on learning curves , 1993, Neural Networks.

[18] B. Widrow,et al. Adaptive inverse control , 1987, Proceedings of 8th IEEE International Symposium on Intelligent Control.

[19] Shun-ichi Amari,et al. Statistical Theory of Learning Curves under Entropic Loss Criterion , 1993, Neural Computation.

[20] Andrzej Cichocki,et al. Neural networks for optimization and signal processing , 1993 .

[21] Hilbert J. Kappen,et al. On-line learning processes in artificial neural networks , 1993 .

[22] O. Kinouchi,et al. Optimal generalization in perceptions , 1992 .

[23] Shun-ichi Amari,et al. Learning Curves, Model Selection and Complexity of Neural Networks , 1992, NIPS.

[24] P. Comon. Independent Component Analysis , 1992 .

[25] Sompolinsky,et al. Statistical mechanics of learning from examples. , 1992, Physical review. A, Atomic, molecular, and optical physics.

[26] Thomas M. Cover,et al. Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing) , 2006 .

[27] John E. Moody,et al. The Effective Number of Parameters: An Analysis of Generalization and Regularization in Nonlinear Learning Systems , 1991, NIPS.

[28] Heskes,et al. Learning processes in neural networks. , 1991, Physical review. A, Atomic, molecular, and optical physics.

[29] David Haussler,et al. Calculation of the learning curve of Bayes optimal classification algorithm for learning a perceptron with noise , 1991, COLT '91.

[30] Christian Jutten,et al. Blind separation of sources, part I: An adaptive algorithm based on neuromimetic architecture , 1991, Signal Process..

[31] Simon Haykin,et al. Adaptive filter theory (2nd ed.) , 1991 .

[32] Thomas M. Cover,et al. Elements of Information Theory , 2005 .

[33] D. B. Fogel,et al. AN INFORMATION CRITERION FOR OPTIMAL NEURAL NETWORK SELECTION , 1990, 1990 Conference Record Twenty-Fourth Asilomar Conference on Signals, Systems and Computers, 1990..

[34] Shun-ichi Amari,et al. Mathematical foundations of neurocomputing , 1990, Proc. IEEE.

[35] F. Girosi,et al. Networks for approximation and learning , 1990, Proc. IEEE.

[36] W. Kinzel. Physics of Neural Networks , 1990 .

[37] Halbert White,et al. Learning in Artificial Neural Networks: A Statistical Perspective , 1989, Neural Computation.

[38] Esther Levin,et al. A statistical approach to learning and generalization in layered neural networks , 1989, Proc. IEEE.

[39] Marvin Minsky,et al. Perceptrons: An Introduction to Computational Geometry, Expanded Edition , 1987 .

[40] J. Rissanen. Stochastic Complexity and Modeling , 1986 .

[41] James L. McClelland,et al. Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[42] Shun-ichi Amari,et al. Differential-geometrical methods in statistics , 1985 .

[43] K. Takeuchi,et al. Asymptotic efficiency of statistical estimators : concepts and higher order asymptotic efficiency , 1981 .

[44] 赤平昌文,et al. Asymptotic efficiency of statistical estimators : concepts and higher order asymptotic efficiency , 1981 .

[45] H. Akaike. A new look at the statistical model identification , 1974 .

[46] Marvin Minsky,et al. Perceptrons: An Introduction to Computational Geometry , 1969 .

[47] Shun-ichi Amari,et al. A Theory of Adaptive Pattern Classifiers , 1967, IEEE Trans. Electron. Comput..

[48] E. L. Lehmann,et al. Theory of point estimation , 1950 .