Neural Networks and the Bias/Variance Dilemma
 U. Grenander. On empirical spectral analysis of stochastic processes , 1952 .
 J. Lamperti. ON CONVERGENCE OF STOCHASTIC PROCESSES , 1962 .
 Frank Rosenblatt,et al. PRINCIPLES OF NEURODYNAMICS. PERCEPTRONS AND THE THEORY OF BRAIN MECHANISMS , 1963 .
 R. Bellman,et al. V. Adaptive Control Processes , 1964 .
 Shun-ichi Amari,et al. A Theory of Adaptive Pattern Classifiers , 1967, IEEE Trans. Electron. Comput..
 David R. Cox. The analysis of binary data , 1970 .
 L. Baum,et al. An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .
 H. Akaike,et al. Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .
 Martin A. Fischler,et al. The Representation and Matching of Pictorial Structures , 1973, IEEE Transactions on Computers.
 Richard O. Duda,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.
 G. Wahba,et al. A completely automatic french curve: fitting spline functions by cross validation , 1975 .
 M. Stone,et al. Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .
 Stephen A. Ritz,et al. Distinctive features, categorical perception, and probability learning: some applications of a neural model , 1977 .
 D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
 G. Wahba. Convergence rates of "thin plate" smoothing splines wihen the data are noisy , 1979 .
 J. Friedman,et al. Projection Pursuit Regression , 1981 .
 E. F. Schuster,et al. On the Nonconsistency of Maximum Likelihood Nonparametric Density Estimators , 1981 .
 David J. Burr,et al. Elastic Matching of Line Drawings , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.
 Grace Wahba,et al. Constrained Regularization for Ill Posed Linear Operator Equations, with Applications in Meteorology and Medicine. , 1982 .
 S. Geman,et al. Nonparametric Maximum Likelihood Estimation by the Method of Sieves , 1982 .
 E. Cook,et al. A computer-derived protocol to aid in the diagnosis of emergency room patients with acute chest pain. , 1982, The New England journal of medicine.
 Leo Breiman,et al. Classification and Regression Trees , 1984 .
 L. Shepp,et al. A Statistical Model for Positron Emission Tomography , 1985 .
 J. Friedman,et al. Estimating Optimal Transformations for Multiple Regression and Correlation. , 1985 .
 G. Wahba. A Comparison of GCV and GML for Choosing the Smoothing Parameter in the Generalized Spline Smoothing Problem , 1985 .
 Geoffrey E. Hinton,et al. A Learning Algorithm for Boltzmann Machines , 1985, Cogn. Sci..
 Grace Wahba,et al. A cross validated bayesian retrieval algorithm for nonlinear remote sensing experiments , 1985 .
 C. Malsburg,et al. Statistical Coding and Short-Term Synaptic Plasticity: A Scheme for Knowledge Representation in the Brain , 1986 .
 Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.
 Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .
 C. von der Malsburg,et al. Am I Thinking Assemblies , 1986 .
 D. Freedman,et al. On the consistency of Bayes estimates , 1986 .
 J. Rissanen. Stochastic Complexity and Modeling , 1986 .
 Geoffrey E. Hinton,et al. Learning and relearning in Boltzmann machines , 1986 .
 Robin Sibson,et al. What is projection pursuit , 1987 .
 Lawrence D. Jackel,et al. Large Automatic Learning, Rule Extraction, and Generalization , 1987, Complex Syst..
 P. Carnevali,et al. Exhaustive Thermodynamical Analysis of Boolean Learning Networks , 1987 .
 E. Veklerov,et al. Stopping Rule for the MLE Algorithm Based on Statistical Hypothesis Testing , 1987, IEEE Transactions on Medical Imaging.
 D. W. Scott,et al. Biased and Unbiased Cross-Validation in Density Estimation , 1987 .
 R. Lippmann,et al. An introduction to computing with neural nets , 1987, IEEE ASSP Magazine.
 R. Dudley. Universal Donsker Classes and Metric Entropy , 1987 .
 Kevin J. Lang,et al. Speech recognition using time‐delay neural networks , 1988 .
 James A. Anderson,et al. Neurocomputing: Foundations of Research , 1988 .
 J. Marron. Automatic smoothing parameter selection: A survey , 1988 .
 Patrick Gallinari,et al. Multilayer perceptrons and data analysis , 1988, IEEE 1988 International Conference on Neural Networks.
 W. Härdle,et al. How Far are Automatically Chosen Regression Smoothing Parameters from their Optimum , 1988 .
 J. Fodor,et al. Connectionism and cognitive architecture: A critical analysis , 1988, Cognition.
 P. Smolensky. On the proper treatment of connectionism , 1988, Behavioral and Brain Sciences.
 Michael C. Mozer,et al. Skeletonization: A Technique for Trimming the Fat from a Network via Relevance Assessment , 1988, NIPS.
 S. Ghosh,et al. An application of a multiple neural network learning system to emulation of mortgage underwriting judgements , 1988, IEEE 1988 International Conference on Neural Networks.
 Isabelle Guyon. Réseaux de neurones pour la reconnaissance des formes : architectures et apprentissage , 1988 .
 Richard Lippmann,et al. Review of Neural Networks for Speech Recognition , 1989, Neural Computation.
 E Bienenstock,et al. Elastic matching and pattern recognition in neural networks. , 1989 .
 David Haussler,et al. What Size Net Gives Valid Generalization? , 1989, Neural Computation.
 Ruzena Bajcsy,et al. Multiresolution elastic matching , 1989, Comput. Vis. Graph. Image Process..
 Ken-ichi Funahashi,et al. On the approximate realization of continuous mappings by neural networks , 1989, Neural Networks.
 Robert Azencott. Synchronous Boltzmann Machines and Gibbs Fields: Learning Algorithms , 1989, NATO Neurocomputing.
 Halbert White,et al. Learning in Artificial Neural Networks: A Statistical Perspective , 1989, Neural Computation.
 Francis Crick,et al. The recent excitement about neural networks , 1989, Nature.
 Hervé Bourlard,et al. Generalization and Parameter Estimation in Feedforward Netws: Some Experiments , 1989, NIPS.
 A. Barron,et al. Statistical properties of artificial neural networks , 1989, Proceedings of the 28th IEEE Conference on Decision and Control,.
 Yves Chauvin. Dynamic Behavior of Constained Back-Propagation Networks , 1989, NIPS.
 George Cybenko,et al. Approximation by superpositions of a sigmoidal function , 1989, Math. Control. Signals Syst..
 Kurt Hornik,et al. Neural networks and principal component analysis: Learning from examples without local minima , 1989, Neural Networks.
 David Haussler,et al. Generalizing the PAC model: sample size bounds from metric dimension-based uniform convergence results , 1989, 30th Annual Symposium on Foundations of Computer Science.
 Lawrence D. Jackel,et al. Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.
 Naftali Tishby,et al. Consistent inference of probabilities in layered networks: predictions and generalizations , 1989, International 1989 Joint Conference on Neural Networks.
 Eric B. Baum,et al. A Proposal for More Powerful Learning Algorithms , 1989, Neural Computation.
 Kurt Hornik,et al. Multilayer feedforward networks are universal approximators , 1989, Neural Networks.
 T Poggio,et al. Regularization Algorithms for Learning That Are Equivalent to Multilayer Networks , 1990, Science.
 Alan L. Yuille,et al. Generalized Deformable Models, Statistical Physics, and Matching Problems , 1990, Neural Computation.
 Eric B. Baum,et al. The Perceptron Algorithm is Fast for Nonmalicious Distributions , 1990, Neural Computation.
 James D. Keeler,et al. Layered Neural Networks with Gaussian Hidden Units as Universal Approximations , 1990, Neural Computation.
 M. L. Rossen,et al. Experiments with Representation in Neural Networks: Object Motion, Speech, and Arithmetic , 1990 .
 Halbert White,et al. Connectionist nonparametric regression: Multilayer feedforward networks can learn arbitrary mappings , 1990, Neural Networks.
 Jenq-Neng Hwang,et al. Projection pursuit learning networks for regression , 1990,  Proceedings of the 2nd International IEEE Conference on Tools for Artificial Intelligence.
 Eric B. Baum,et al. When Are k-Nearest Neighbor and Back Propagation Accurate for Feasible Sized Sets of Examples? , 1990, EURASIP Workshop.
 Geoffrey E. Hinton,et al. The Bootstrap Widrow-Hoff Rule as a Cluster-Formation Algorithm , 1990, Neural Computation.
 Ehud D. Karnin,et al. A simple procedure for pruning back-propagation trained neural networks , 1990, IEEE Trans. Neural Networks.
 J. Faraway,et al. Bootstrap choice of bandwidth for density estimation , 1990 .
 Vijay K. Samalam,et al. Exhaustive Learning , 1990, Neural Computation.
 H. Bourlard,et al. Links Between Markov Models and Multilayer Perceptrons , 1990, IEEE Trans. Pattern Anal. Mach. Intell..
 James A. Pittman,et al. Recognizing Hand-Printed Letters and Digits Using Backpropagation Learning , 1991, Neural Computation.
 Andrew R. Barron,et al. Complexity Regularization with Application to Artificial Neural Networks , 1991 .
 U. Grenander,et al. Structural Image Restoration through Deformable Templates , 1991 .
 J. Freidman,et al. Multivariate adaptive regression splines , 1991 .
 Shun-ichi Amari,et al. Dualistic geometry of the manifold of higher-order neurons , 1991, Neural Networks.
 David Haussler,et al. Decision Theoretic Generalizations of the PAC Model for Neural Net and Other Learning Applications , 1992, Inf. Comput..
 Christoph von der Malsburg,et al. The Correlation Theory of Brain Function , 1994 .