论文信息 - Two Timescale Analysis of the Alopex Algorithm for Optimization

Two Timescale Analysis of the Alopex Algorithm for Optimization

Alopex is a correlation-based gradient-free optimization technique useful in many learning problems. However, there are no analytical results on the asymptotic behavior of this algorithm. This article presents a new version of Alopex that can be analyzed using techniques of two timescale stochastic approximation method. It is shown that the algorithm asymptotically behaves like a gradient-descent method, though it does not need (or estimate) any gradient information. It is also shown, through simulations, that the algorithm is quite effective.

[1] Evangelia Micheli-Tzanakou,et al. Supervised and unsupervised pattern recognition: feature extraction and computational intelligence , 2000 .

[2] Usama M. Fayyad,et al. On the Handling of Continuous-Valued Attributes in Decision Tree Generation , 1992, Machine Learning.

[3] Vivek S. Borkar,et al. Stochastic approximation algorithms: Overview and recent trends , 1999 .

[4] Alejandro Bia. Alopex-B: A New, Simpler, But Yet Faster Version Of The Alopex Training Algorithm , 2001, Int. J. Neural Syst..

[5] Thomas Kailath,et al. Model-free distributed learning , 1990, IEEE Trans. Neural Networks.

[6] E Harth,et al. Alopex: a stochastic method for determining visual receptive fields. , 1974, Vision research.

[7] Kumpati S. Narendra,et al. Learning automata - an introduction , 1989 .

[8] K. Schittkowski,et al. NONLINEAR PROGRAMMING , 2022 .

[9] Simon Kasif,et al. A System for Induction of Oblique Decision Trees , 1994, J. Artif. Intell. Res..

[10] E. Harth,et al. Brainstem control of sensory information: a mechanism for perception. , 1985, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[11] Abhijit S. Pandya,et al. A recurrent neural network controller and learning algorithm for the on-line learning control of autonomous underwater vehicles , 1994, Neural Networks.

[12] J. Spall. Multivariate stochastic approximation using a simultaneous perturbation gradient approximation , 1992 .

[13] Carlos S. Kubrusly,et al. Stochastic approximation algorithms and applications , 1973, CDC 1973.

[14] Marwan A. Jabri,et al. Weight Perturbation: An Optimal Architecture and Learning Technique for Analog VLSI Feedforward and Recurrent Multilayer Networks , 1991, Neural Comput..

[15] P. Shanti Sastry,et al. New algorithms for learning and pruning oblique decision trees , 1999, IEEE Trans. Syst. Man Cybern. Part C.

[16] P. S. Sastry,et al. Continuous action set learning automata for stochastic optimization , 1994 .

[17] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[18] P. Sawchenko,et al. Localization, secretion, and action of inhibin in human placenta. , 1987, Science.

[19] Karl Branting,et al. A computational model of ratio decidendi , 2004, Artificial Intelligence and Law.

[20] H. S. Nine,et al. The Role of Subplate Feedback in the Development of Ocular Dominance Columns , 1993 .

[21] K. P. Unnikrishnan,et al. Alopex: A Correlation-Based Learning Algorithm for Feedforward and Recurrent Neural Networks , 1994, Neural Computation.

[22] K. Rajaraman,et al. Stochastic optimization over continuous and discrete variables with applications to concept learning under noise , 1999, IEEE Trans. Syst. Man Cybern. Part A.

[23] J. Ross Quinlan,et al. Learning Efficient Classification Procedures and Their Application to Chess End Games , 1983 .

[24] Marwan A. Jabri,et al. Weight Perturbation: An Optimal Architecture and Learning Technique for Analog VLSI Feedforward and Recurrent Multilayer Networks , 1991, Neural Computation.

[25] V. Borkar. Stochastic approximation with two time scales , 1997 .

[26] Abhijit S. Pandya,et al. Invariant recognition of 2-D objects using Alopex neural networks , 1992, Defense, Security, and Sensing.

[27] E Harth,et al. The inversion of sensory processing by feedback pathways: a model of visual cognitive functions. , 1987, Science.

[28] Carla E. Brodley,et al. Multivariate decision trees , 2004, Machine Learning.