Multi-layer Perceptron Error Surfaces: Visualization, Structure and Modelling
 Lutz Prechelt. A study of experimental evaluations of neural network learning algorithms: current research practice , 1994 .
 Peter Auer,et al. Exponentially many local minima for single neurons , 1995, NIPS.
 Heidar A. Malki,et al. Using the Karhunen-Loe've transformation in the back-propagation training algorithm , 1991, IEEE Trans. Neural Networks.
 Joachim Diederich,et al. Survey and critique of techniques for extracting rules from trained artificial neural networks , 1995, Knowl. Based Syst..
 Bryan P. Bergeron. Using a spreadsheet metaphor to visualize neural network behavior , 1990 .
 Brian Everitt,et al. Graphical Techniques for Multivariate Data. , 1978 .
 Stefan M. Rüger,et al. Clustering in Weight Space of Feedforward Nets , 1996, ICANN.
 Norio Baba,et al. A new approach for finding the global minimum of error function of neural networks , 1989, Neural Networks.
 Jude W. Shavlik,et al. Combining the Predictions of Multiple Classifiers: Using Competitive Learning to Initialize Neural Networks , 1995, IJCAI.
 Benjamin W. Wah,et al. Global Optimization for Neural Network Training , 1996, Computer.
 Wim Hordijk,et al. A Measure of Landscapes , 1996, Evolutionary Computation.
 R. H. Glendinning,et al. Multivariate Density Estimation, Theory, Practice and Visualization , 1992 .
 Tony Plate,et al. Visualizing the Function Computed by a Feedforward Neural Network , 2000, Neural Computation.
 Jeffrey L. Elman,et al. Distributed Representations, Simple Recurrent Networks, and Grammatical Structure , 1991, Mach. Learn..
 Ralf Salomon,et al. Raising Theoretical Questions About the Utility of Genetic Algorithms , 1997, Evolutionary Programming.
 Seunghwan Kim,et al. Chaotic dynamics and the geometry of the error surface in neural networks , 1992 .
 Bhaskar D. Rao,et al. A generalized learning paradigm exploiting the structure of feedforward neural networks , 1996, IEEE Trans. Neural Networks.
 Emile Fiesler,et al. Neural Network Initialization , 1995, IWANN.
 Roberto Battiti,et al. Accelerated Backpropagation Learning: Two Optimization Methods , 1989, Complex Syst..
 Steve R. White,et al. Configuration Space Analysis for Optimization Problems , 1986 .
 Brian Everitt,et al. Cluster analysis , 1974 .
 J. Beasley. Population Heuristics , 1999 .
 Tom Tollenaere,et al. SuperSAB: Fast adaptive back propagation with good scaling properties , 1990, Neural Networks.
 Kurt Hornik,et al. Neural networks and principal component analysis: Learning from examples without local minima , 1989, Neural Networks.
 Marco Dorigo,et al. Ant system: optimization by a colony of cooperating agents , 1996, IEEE Trans. Syst. Man Cybern. Part B.
 Marco Gori,et al. Optimal convergence of on-line backpropagation , 1996, IEEE Trans. Neural Networks.
 YoungJu Choie,et al. Local minima and back propagation , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.
 Adrian J. Shepherd,et al. Second-Order Methods for Neural Networks , 1997 .
 Wojtek J. Krzanowski,et al. Principles of multivariate analysis : a user's perspective. oxford , 1988 .
 Jürgen Schmidhuber,et al. Simplifying Neural Nets by Discovering Flat Minima , 1994, NIPS.
 Virginia L. Stonick,et al. 488 Solutions to the XOR Problem , 1996, NIPS.
 Lee Altenberg,et al. Fitness Distance Correlation Analysis: An Instructive Counterexample , 1997, ICGA.
 David A. Medler. A Brief History of Connectionism , 1998 .
 Martin A. Riedmiller,et al. Advanced supervised learning in multi-layer perceptrons — From backpropagation to adaptive learning algorithms , 1994 .
 Ray A. Jarvis,et al. Adaptive Global Search by the Process of Competitive Evolution , 1975, IEEE Transactions on Systems, Man, and Cybernetics.
 Ah Chung Tsoi,et al. Comments on local minima free conditions in multilayer perceptrons , 1998, IEEE Trans. Neural Networks.
 Russell W. Anderson. Biased Random-Walk Learning: A Neurobiological Correlate to Trial-and-Error , 1993, adap-org/9305002.
 Jondarr Gibb. Back propagation Family Album , 1996 .
 Stuart A. Kauffman,et al. ORIGINS OF ORDER , 2019, Origins of Order.
 Martin Pelikan,et al. Hill Climbing with Learning (An Abstraction of Genetic Algorithm) , 1995 .
 Mohamed Slimane,et al. A Critical and Empirical Study of Epistasis Measures for Predicting GA Performances: A Summary , 1997, Artificial Evolution.
 Leonid Kruglyak. How to Solve the N Bit Encoder Problem with Just Two Hidden Units , 1990, Neural Computation.
 Shumeet Baluja,et al. Genetic Algorithms and Explicit Search Statistics , 1996, NIPS.
 R. Fletcher. Practical Methods of Optimization , 1988 .
 Tamás D. Gedeon,et al. An improved technique in porosity prediction: a neural network approach , 1995, IEEE Trans. Geosci. Remote. Sens..
 Douglass J. Wilde,et al. Foundations of Optimization. , 1967 .
 Helen G. Cobb. Is the Genetic Algorithm a Cooperative Learner? , 1992, FOGA.
 Eduardo D. Sontag,et al. Backpropagation Can Give Rise to Spurious Local Minima Even for Networks without Hidden Layers , 1989, Complex Syst..
 Shin'ichi Tamura,et al. Capabilities of a four-layered feedforward neural network: four layers versus three , 1997, IEEE Trans. Neural Networks.
 E. K. Blum,et al. Approximation of Boolean Functions by Sigmoidal Networks: Part I: XOR and Other Two-Variable Functions , 1989, Neural Computation.
 Louise Travé-Massuyès,et al. Telephone Network Traffic Overloading Diagnosis and Evolutionary Computation Techniques , 1997, Artificial Evolution.
 Catherine Blake,et al. UCI Repository of machine learning databases , 1998 .
 Jason Williams,et al. Neuralis: an artificial neural network package , 1996, ITiCSE '96.
 Michael A. Arbib,et al. Part II: road maps , 1998 .
 Shumeet Baluja,et al. A Method for Integrating Genetic Search Based Function Optimization and Competitive Learning , 1994 .
 Christopher M. Bishop,et al. A Hierarchical Latent Variable Model for Data Visualization , 1998, IEEE Trans. Pattern Anal. Mach. Intell..
 Alberto Tesi,et al. On the Problem of Local Minima in Backpropagation , 1992, IEEE Trans. Pattern Anal. Mach. Intell..
 D. Rumelhart,et al. The effective dimension of the space of hidden units , 1991, [Proceedings] 1991 IEEE International Joint Conference on Neural Networks.
 Tim Jones. Evolutionary Algorithms, Fitness Landscapes and Search , 1995 .
 Janet Wiles,et al. The N-2-N Encoder: A Matter of Representation , 1993 .
 M. Servais,et al. Function Optimisation Using Multiple-base Population Based Incremental Learning , 1997 .
 Len Hamey. Analysis of the error surface of the XOR network with two hidden nodes , 1996 .
 Kenneth Dean Boese,et al. Models for iterative global optimization , 1996 .
 Jan Paredis,et al. Exploiting constraints as background knowledge for evolutionary algorithms , 1997 .
 S. J. Huang,et al. Training algorithm based on Newton's method with dynamic error control , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.
 D. Hamad,et al. Interactive pattern classification by means of artificial neural networks , 1995, 1995 IEEE International Conference on Systems, Man and Cybernetics. Intelligent Systems for the 21st Century.
 Paul A. Viola,et al. MIMIC: Finding Optima by Estimating Probability Densities , 1996, NIPS.
 Jude W. Shavlik,et al. Visualizing Learning and Computation in Artificial Neural Networks , 1992, Int. J. Artif. Intell. Tools.
 W. Cleveland,et al. The elements of graphing data , 1985 .
 Leonard G. C. Hamey,et al. The structure of neural network error surfaces , 1995 .
 Ida G. Sprinkhuizen-Kuyper,et al. The error surface of the 2-2-1 XOR network: The finite stationary points , 1998, Neural Networks.
 Lawrence D. Jackel,et al. Large Automatic Learning, Rule Extraction, and Generalization , 1987, Complex Syst..
 Penny Rheingans,et al. Visualizing structure in high-dimensional multivariate data , 1991, IBM J. Res. Dev..
 David Birnbaum. WS Cleveland .The Elements of Graphing Data. , 1996 .
 Sheng-De Wang,et al. A self growing learning algorithm for determining the appropriate number of hidden units , 1991, [Proceedings] 1991 IEEE International Joint Conference on Neural Networks.
 David H. Wolpert,et al. What makes an optimization problem hard? , 1995, Complex..
 Andrew B. Kahng. Exploiting fractalness of error surfaces: New methods for neural network learning , 1992, [Proceedings] 1992 IEEE International Symposium on Circuits and Systems.
 Bernard Widrow,et al. Scaled stochastic methods for training neural networks , 1996 .
 Derek Partridge. Network generalization differences quantified , 1996, Neural Networks.
 Lutz Prechelt. Some notes on neural learning algorithm benchmarking , 1995, Neurocomputing.
 K. Lang,et al. Learning to tell two spirals apart , 1988 .
 David A. Landgrebe,et al. Supervised classification in high-dimensional space: geometrical, statistical, and asymptotical properties of multivariate data , 1998, IEEE Trans. Syst. Man Cybern. Part C.
 Heekuck Oh,et al. Neural Networks for Pattern Recognition , 1993, Adv. Comput..
 Paul C. Kainen,et al. Functionally Equivalent Feedforward Neural Networks , 1994, Neural Computation.
 S. Ergezinger,et al. An accelerated learning algorithm for multilayer perceptrons: optimization layer by layer , 1995, IEEE Trans. Neural Networks.
 Partha Pratim Kanjilal,et al. On the application of orthogonal transformation for the design and analysis of feedforward networks , 1995, IEEE Trans. Neural Networks.
 Paolo Frasconi,et al. Learning in multilayered networks used as autoassociators , 1995, IEEE Trans. Neural Networks.
 S. Baluja. An Empirical Comparison of Seven Iterative and Evolutionary Function Optimization Heuristics , 1995 .
 Yoshio Mogami,et al. A hybrid algorithm for finding the global minimum of error function of neural networks and its applications , 1994, Neural Networks.
 Rich Caruana,et al. Removing the Genetics from the Standard Genetic Algorithm , 1995, ICML.
 R. Salomon. Re-evaluating genetic algorithm performance under coordinate rotation of benchmark functions. A survey of some theoretical and practical aspects of genetic algorithms. , 1996, Bio Systems.
 Emile Fiesler,et al. High-order and multilayer perceptron initialization , 1997, IEEE Trans. Neural Networks.
 Doris Aaronson,et al. Visualization of multivariate data: Human-factors considerations , 1995 .
 J. Urgen Branke. Evolutionary Algorithms for Neural Network Design and Training , 1995 .
 Russell Reed,et al. Pruning algorithms-a survey , 1993, IEEE Trans. Neural Networks.
 Mohammad Bagher Menhaj,et al. Training feedforward networks with the Marquardt algorithm , 1994, IEEE Trans. Neural Networks.
 Christian J. Darken. Stochastic approximation and neural network learning , 1998 .
 Martin Fodslette Meiller. A Scaled Conjugate Gradient Algorithm for Fast Supervised Learning , 1993 .
 Terrence J. Sejnowski,et al. Tempering Backpropagation Networks: Not All Weights are Created Equal , 1995, NIPS.
 Reuven Y. Rubinstein,et al. Optimization of computer simulation models with rare events , 1997 .
 David E. Goldberg,et al. Genetic Algorithms in Search Optimization and Machine Learning , 1988 .
 Markus Höhfeld,et al. Improving the Generalization Performance of Multi-Layer-Perceptrons with Population-Based Incremental Learning , 1996, PPSN.
 Luís B. Almeida,et al. Speeding up Backpropagation , 1990 .
 Xin Yao,et al. Evolutionary Artificial Neural Networks , 1993, Int. J. Neural Syst..
 Andrew B. Kahng,et al. Simulated annealing of neural networks: The 'cooling' strategy reconsidered , 1993, 1993 IEEE International Symposium on Circuits and Systems.
 W. Kinzel. Physics of Neural Networks , 1990 .
 Rajesh Parekh,et al. Analysis of Decision Boundaries Generated by Constructive Neural Network Learning Algorithms , 1995 .
 S. Thomas Alexander,et al. Adaptive Signal Processing , 1986, Texts and Monographs in Computer Science.
 Robert A. Jacobs,et al. Increased rates of convergence through learning rate adaptation , 1987, Neural Networks.
 D. Rumelhart,et al. Generalization through Minimal Networks with Application to Forecasting , 1992 .
 S. Baluja,et al. Using Optimal Dependency-Trees for Combinatorial Optimization: Learning the Structure of the Search Space , 1997 .
 Geoffrey E. Hinton,et al. Distributed Representations , 1986, The Philosophy of Artificial Intelligence.
 Roger J.-B. Wets,et al. Minimization by Random Search Techniques , 1981, Math. Oper. Res..
 Radford M. Neal. Assessing Relevance determination methods using DELVE , 1998 .
 G. Toulouse,et al. Ultrametricity for physicists , 1986 .
 I. Cloete,et al. Animating neural network training , 1992 .
 R. Hecht-Nielsen,et al. Theory of the Back Propagation Neural Network , 1989 .
 Tamás D. Gedeon,et al. Simulated annealing and weight decay in adaptive learning: the SARPROP algorithm , 1998, IEEE Trans. Neural Networks.
 J. Elman. Distributed Representations, Simple Recurrent Networks, And Grammatical Structure , 1991 .
 D. R. Hush,et al. Error surfaces for multi-layer perceptrons , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.
 Albert Y. Zomaya,et al. Toward generating neural network structures for function approximation , 1994, Neural Networks.
 Martin A. Riedmiller,et al. A direct adaptive method for faster backpropagation learning: the RPROP algorithm , 1993, IEEE International Conference on Neural Networks.
 Héctor J. Sussmann,et al. Uniqueness of the weights for minimal feedforward nets with a given input-output map , 1992, Neural Networks.
 Ida G. Sprinkhuizen-Kuyper,et al. The Error Surface of the Simplest XOR Network Has Only Global Minima , 1996, Neural Computation.
 James Kennedy,et al. Particle swarm optimization , 1995, Proceedings of ICNN'95 - International Conference on Neural Networks.
 M. Conrad. The geometry of evolution. , 1990, Bio Systems.
 Tad Hogg,et al. Solving the Really Hard Problems with Cooperative Search , 1993, AAAI.
 Hans-Paul Schwefel,et al. Numerical optimization of computer models , 1981 .
 M. Conrad,et al. M.V. Volkenstein, evolutionary thinking and the structure of fitness landscapes. , 1992, Bio Systems.
 Roberto Battiti,et al. First- and Second-Order Methods for Learning: Between Steepest Descent and Newton's Method , 1992, Neural Computation.
 B. Orsier,et al. Another Hybrid Algorithm for Nding a Global Mimimum of Mlp Error Functions , 1996 .
 David A. Thomas,et al. Integrated mathematics, science, and technology: an introduction to scientific visualization , 1996 .
 Gerald Tesauro,et al. Visualizing processes in neural networks , 1991, IBM J. Res. Dev..
 Etienne Barnard,et al. Optimization for training neural nets , 1992, IEEE Trans. Neural Networks.
 Michèle Sebag,et al. Extending Population-Based Incremental Learning to Continuous Search Spaces , 1998, PPSN.
 Fabio Stella,et al. Some numerical aspects of the training problem for feed-forward neural nets , 1997, Neural Networks.
 Esther Levin,et al. Accelerated Learning in Layered Neural Networks , 1988, Complex Syst..
 Michèle Sebag,et al. Mimetic Evolution , 1997, Artificial Evolution.
 Gary J. Koehler,et al. Deterministic global optimal FNN training algorithms , 1994, Neural Networks.
 Dimitris A. Karras,et al. An efficient constrained training algorithm for feedforward networks , 1995, IEEE Trans. Neural Networks.
 Andrew G. Barto,et al. Learning as hill-climbing in weight space , 1998 .
 Simon Haykin,et al. Neural Networks: A Comprehensive Foundation , 1998 .
 Robert Hecht-Nielsen,et al. On the Geometry of Feedforward Neural Network Error Surfaces , 1993, Neural Computation.
 Marvin Minsky,et al. Perceptrons - an introduction to computational geometry , 1969 .
 Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .
 Yann LeCun,et al. Second Order Properties of Error Surfaces: Learning Time and Generalization , 1990, NIPS 1990.
 Peter F. Stadler,et al. Towards a theory of landscapes , 1995 .
 Leonard G. C. Hamey,et al. XOR has no local minima: A case study in neural network error surface analysis , 1998, Neural Networks.
 Anders Krogh,et al. Introduction to the theory of neural computation , 1994, The advanced book program.
 Stefan M. Rüger,et al. An analysis of the metric structure of the weight space of feedforward networks and its application to time series modeling and prediction , 1996, ESANN.
 Peter F. Stadler,et al. Amplitude Spectra of Fitness Landscapes , 1998, Adv. Complex Syst..
 Bernardo A. Huberman,et al. The performance of cooperative processes , 1990 .
 Thomas G. Dietterich. Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.
 William I. Grosky,et al. A fast algorithm for finding global minima of error functions in layered neural networks , 1990, 1990 IJCNN International Joint Conference on Neural Networks.
 Paul W. Munro. Visualizations of 2-D hidden unit space , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.
 William S. Cleveland,et al. Visualizing Data , 1993 .
 F. Jordan,et al. Using the symmetries of a multi-layered network to reduce the weight space , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.
 L. Darrell Whitley,et al. GENITOR II: a distributed genetic algorithm , 1990, J. Exp. Theor. Artif. Intell..
 Raúl Rojas. Oscillating iteration paths in neural networks learning , 1994, Comput. Graph..
 Elie Bienenstock,et al. Neural Networks and the Bias/Variance Dilemma , 1992, Neural Computation.
 Raymond Lister. Visualizing weight dynamics in the N-2-N encoder , 1993, IEEE International Conference on Neural Networks.
 Harold M. Hastings,et al. The may-wigner stability theorem , 1982 .
 Yo Horikawa. Landscapes of basins of local minima in the XOR problem , 1993, Proceedings of 1993 International Conference on Neural Networks (IJCNN-93-Nagoya, Japan).
 Kurt Hornik,et al. Learning in linear neural networks: a survey , 1995, IEEE Trans. Neural Networks.
 Masanao Ohbayashi,et al. A new random search method for neural network learning-RasID , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).
 Jürgen Schmidhuber,et al. Flat Minima , 1997, Neural Computation.
 David B. Fogel,et al. Alternative Neural Network Training Methods , 1995, IEEE Expert.
 Raymond Lister. Fractal strategies for neural network scaling , 1998 .
 Werner Purgathofer,et al. Selected new trends in scientific visualization , 1998, Other Conferences.
 B. Manly. Multivariate Statistical Methods : A Primer , 1986 .
 Bruce E. Rosen,et al. VFSR trained artificial neural networks , 1993, Proceedings of 1993 International Conference on Neural Networks (IJCNN-93-Nagoya, Japan).
 Etienne Barnard,et al. A comparison between criterion functions for linear classifiers, with an application to neural nets , 1989, IEEE Trans. Syst. Man Cybern..
 John F. Kolen,et al. Backpropagation is Sensitive to Initial Conditions , 1990, Complex Syst..
 P. Lisboa,et al. Complete solution of the local minima in the XOR problem , 1991 .
 Simon Dennis,et al. Analysis Tools for Neural Networks , 1991 .
 Warren T. Jones,et al. DENDRITE: A system for visual interpretation of neural network data , 1992, Proceedings IEEE Southeastcon '92.
 Jack Sklansky,et al. A neural network that visualizes what it classifies , 1997, Pattern Recognit. Lett..
 John G. Taylor,et al. The New ERA in Supervised Learning , 1997, Neural Networks.
 Hans-Paul Schwefel,et al. Evolution and optimum seeking , 1995, Sixth-generation computer technology series.
 Lutz Prechelt. Investigation of the CasCor Family of Learning Algorithms , 1997, Neural Networks.
 John H. Holland,et al. Adaptation in natural and artificial systems , 1975 .
 Chih-Cheng Chen,et al. A fast multilayer neural-network training algorithm based on the layer-by-layer optimizing procedures , 1996, IEEE Trans. Neural Networks.
 Brian D. Ripley,et al. Pattern Recognition and Neural Networks , 1996 .
 N. Parga,et al. Ultrametricity, frustration and the graph colouring problem , 1989 .
 R. Summers,et al. Artificial neural networks: from black-box to grey-box modelling , 1994, Proceedings of 16th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.
 John F. Kolen,et al. Learning in parallel distributed processing networks: Computational complexity and information content , 1991, IEEE Trans. Syst. Man Cybern..
 Singiresu S. Rao. Engineering Optimization : Theory and Practice , 2010 .
 John W. Tukey,et al. Exploratory Data Analysis , 1980, ACM SIGSPATIAL International Workshop on Advances in Geographic Information Systems.
 S. Phillips. The E ect of Representation on Error Surface , 1993 .
 J. van Leeuwen,et al. Neural Networks: Tricks of the Trade , 2002, Lecture Notes in Computer Science.
 Stephen Robert Lawrence. Neural networks for real world tasks : limitations and solutions , 1997 .
 Robert M. Burton,et al. Convergence and divergence in neural networks: Processing of chaos and biological analogy , 1992, Neural Networks.
 M. Opper,et al. 5 Statistical Mechanics of Generalization , .
 Horst Bischof,et al. Constructing a neural network for the interpretation of the species of trees in aerial photographs , 1990,  Proceedings. 10th International Conference on Pattern Recognition.
 Jacques de Villiers,et al. Backpropagation neural nets with one and two hidden layers , 1993, IEEE Trans. Neural Networks.
 David B. Fogel,et al. An introduction to simulated evolutionary optimization , 1994, IEEE Trans. Neural Networks.
 Javier E. Vitela,et al. Premature saturation in backpropagation networks: Mechanism and necessary conditions , 1995 .
 J. Elman. Representation and structure in connectionist models , 1991 .
 Berndt Müller,et al. Neural networks: an introduction , 1990 .
 Stephen B. Vardeman. Graphical Methods for Data Analysis , 1984 .
 Bing J. Sheu,et al. Optimization schemes for neural network training , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).
 Roberto Battiti,et al. Training neural nets with the reactive tabu search , 1995, IEEE Trans. Neural Networks.
 Ultrametricity Transition in the Graph Colouring Problem , 1986 .
 Michael N. Vrahatis,et al. Geometry of learning: visualizing the performance of neural network supervised training methods , 1997 .
 G. J. Gibson,et al. On the decision regions of multilayer perceptrons , 1990, Proc. IEEE.
 Raúl Rojas. The fractal geometry of backpropagation , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).
 Oliver Wendt,et al. Cooperative Simulated Annealing: How much cooperation is enough ? , 1998 .
 Kennetb A. De. Genetic Algorithms Are NOT Function Optimizers , 1992 .
 R. Hecht-Nielsen. ON THE ALGEBRAIC STRUCTURE OF FEEDFORWARD NETWORK WEIGHT SPACES , 1990 .
 Thomas Bäck,et al. An Overview of Evolutionary Algorithms for Parameter Optimization , 1993, Evolutionary Computation.
 Robert Hecht-Nielsen. The Munificence of High Dimensionality , 1992 .
 Lutz Prechelt. PROBEN 1 - a set of benchmarks and benchmarking rules for neural network training algorithms , 1994 .
 Raúl Rojas. Visualizing the learning process for neural networks , 1994, ESANN.
 J. Edward Jackson,et al. A User's Guide to Principal Components. , 1991 .
 D. Wolpert,et al. No Free Lunch Theorems for Search , 1995 .
 Alexander Linden,et al. Inversion of neural networks by gradient descent , 1990, Parallel Comput..
 S. Kirkpatrick,et al. Configuration space analysis of travelling salesman problems , 1985 .
 John T. Behrens,et al. Applications of multivariate visualization to behavioral sciences , 1995 .
 George Cybenko,et al. Ill-Conditioning in Neural Network Training Problems , 1993, SIAM J. Sci. Comput..
 Geoffrey E. Hinton,et al. Bayesian Learning for Neural Networks , 1995 .
 Gerald Tesauro,et al. Neural Network Visualization , 1989, NIPS.
 Michael A. Arbib,et al. Part I: The Background , 1998 .
 Robert P. W. Duin,et al. Initializations, back-propagation and generalization of feed-forward classifiers , 1993, IEEE International Conference on Neural Networks.
 Wj Fitzgerald,et al. Optimization schemes for neural networks , 1993 .
 Todd K. Leen,et al. Weight Space Probability Densities in Stochastic Learning: II. Transients and Basin Hopping Times , 1992, NIPS.