论文信息 - Bayesian Learning via Stochastic Dynamics

Bayesian Learning via Stochastic Dynamics

The attempt to find a single "optimal" weight vector in conventional network training can lead to overfitting and poor generalization. Bayesian methods avoid this, without the need for a validation set, by averaging the outputs of many networks with weights sampled from the posterior distribution given the training data. This sample can be obtained by simulating a stochastic dynamical system that has the posterior as its stationary distribution.

Radford M. Neal

[1] N. Metropolis,et al. Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[2] H. C. Andersen. Molecular dynamics simulations at constant pressure and/or temperature , 1980 .

[3] S. Duane,et al. Hybrid Monte Carlo , 1987 .

[4] Wray L. Buntine,et al. Bayesian Back-Propagation , 1991, Complex Syst..

[5] David J. C. MacKay,et al. A Practical Bayesian Framework for Backpropagation Networks , 1992, Neural Computation.

[6] Radford M. Neal. Bayesian training of backpropagation networks by the hybrid Monte-Carlo method , 1992 .