Improving model selection by nonconvergent methods

[1]  W. Finnoff,et al.  Detecting structure in small datasets by network fitting under complexity constraints , 1994, COLT 1994.

[2]  William Finnoff,et al.  Diffusion Approximations for the Constant Learning Rate Backpropagation Algorithm and Resistance to Local Minima , 1992, Neural Computation.

[3]  L. Ljung,et al.  Overtraining, Regularization, and Searching for Minimum in Neural Networks , 1992 .

[4]  Hans-Georg Zimmermann,et al.  A comparison of weight elimination methods for reducing complexity in neural networks , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[5]  Vladimir Vapnik,et al.  Principles of Risk Minimization for Learning Theory , 1991, NIPS.

[6]  Isabelle Guyon,et al.  Structural Risk Minimization for Character Recognition , 1991, NIPS.

[7]  Pierre Baldi,et al.  Temporal Evolution of Generalization during Learning in Linear Networks , 1991, Neural Computation.

[8]  D. Rumelhart,et al.  The effective dimension of the space of hidden units , 1991, [Proceedings] 1991 IEEE International Joint Conference on Neural Networks.

[9]  W. Finnoff Complexity measures for classes of neural networks with variable weight bounds , 1991, [Proceedings] 1991 IEEE International Joint Conference on Neural Networks.

[10]  David E. Rumelhart,et al.  Generalization by Weight-Elimination with Application to Forecasting , 1990, NIPS.

[11]  John E. Moody,et al.  Note on Learning Rate Schedules for Stochastic Optimization , 1990, NIPS.

[12]  Halbert White,et al.  Learning in Artificial Neural Networks: A Statistical Perspective , 1989, Neural Computation.

[13]  William Finnoff,et al.  Diffusion Approximations for the Constant Step Size Backpropagation Algorithm and Resistance to Local Minima , 1992, NIPS.

[14]  Hans-Georg Zimmermann,et al.  Domain Independent Testing and Performance Comparisons for Neural Networks , 1992 .

[15]  Shigeo Abe,et al.  Optimal Input Selection of Neural Networks by Sensitivity Analysis and Its Application to Image Recognition , 1990, MVA.

[16]  David E. Rumelhart,et al.  Predicting the Future: a Connectionist Approach , 1990, Int. J. Neural Syst..

[17]  Christian Lebiere,et al.  The Cascade-Correlation Learning Architecture , 1989, NIPS.

[18]  Hervé Bourlard,et al.  Generalization and Parameter Estimation in Feedforward Netws: Some Experiments , 1989, NIPS.

[19]  Yann LeCun,et al.  Optimal Brain Damage , 1989, NIPS.

[20]  M. C. Jones,et al.  Spline Smoothing and Nonparametric Regression. , 1989 .

[21]  Lorien Y. Pratt,et al.  Comparing Biases for Minimal Network Construction with Back-Propagation , 1988, NIPS.

[22]  M. Stone Cross-validation:a review 2 , 1978 .