论文信息 - Rescaling of variables in back propagation learning

Rescaling of variables in back propagation learning

Abstract Use of the logistic derivative in backward error propagation suggests one source of ill-conditioning to be the decreasing multiplier in the computation of the elements of the gradient at each layer. A compensatory rescaling is suggested, based heuristically upon the expected value of the multiplier. Experimental results demonstrate an order of magnitude improvement in convergence.

[1] Robert A. Jacobs,et al. Increased rates of convergence through learning rate adaptation , 1987, Neural Networks.

[2] R. Fletcher. Practical Methods of Optimization , 1988 .

[3] B. D. Ripley,et al. Uses and abuses of statistical simulation , 1988, Math. Program..

[4] H. Akaike. On a successive transformation of probability distribution and its application to the analysis of the optimum gradient method , 1959 .

[5] O. Taussky. Contributions to the Solution of Systems of Linear Equations and the Determination of Eigenvalues , 1954 .

[6] H. B. Curry. The method of steepest descent for non-linear minimization problems , 1944 .