论文信息 - Scalable Techniques from Nonparametric Statistics for Real Time Robot Learning

Scalable Techniques from Nonparametric Statistics for Real Time Robot Learning

Locally weighted learning (LWL) is a class of techniques from nonparametric statistics that provides useful representations and training algorithms for learning about complex phenomena during autonomous adaptive control of robotic systems. This paper introduces several LWL algorithms that have been tested successfully in real-time learning of complex robot tasks. We discuss two major classes of LWL, memory-based LWL and purely incremental LWL that does not need to remember any data explicitly. In contrast to the traditional belief that LWL methods cannot work well in high-dimensional spaces, we provide new algorithms that have been tested on up to 90 dimensional learning problems. The applicability of our LWL algorithms is demonstrated in various robot learning examples, including the learning of devil-sticking, pole-balancing by a humanoid robot arm, and inverse-dynamics learning for a seven and a 30 degree-of-freedom robot. In all these examples, the application of our statistical neural networks techniques allowed either faster or more accurate acquisition of motor control than classical control engineering.

[1] V. Mountcastle. Modality and topographic properties of single neurons of cat's somatic sensory cortex. , 1957, Journal of neurophysiology.

[2] D. Hubel,et al. Receptive fields of single neurones in the cat's striate cortex , 1959, The Journal of physiology.

[3] E. Nadaraya. On Estimating Regression , 1964 .

[4] G. S. Watson,et al. Smooth regression analysis , 1964 .

[5] Athanasios Papoulis,et al. Probability, Random Variables and Stochastic Processes , 1965 .

[6] J. Hájek. A course in nonparametric statistics , 1969 .

[7] Stephen R. McReynolds,et al. The computation and theory of optimal control , 1970 .

[8] Richard O. Duda,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[9] M. Stone. Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[10] G. Wahba,et al. A completely automatic french curve: fitting spline functions by cross validation , 1975 .

[11] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[12] Carl de Boor,et al. A Practical Guide to Splines , 1978, Applied Mathematical Sciences.

[13] W. Cleveland. Robust Locally Weighted Regression and Smoothing Scatterplots , 1979 .

[14] W. W. Muir,et al. Regression Diagnostics: Identifying Influential Data and Sources of Collinearity , 1980 .

[15] J. Friedman,et al. Projection Pursuit Regression , 1981 .

[16] Leo Breiman,et al. Classification and Regression Trees , 1984 .

[17] Lennart Ljung,et al. Theory and Practice of Recursive Identification , 1983 .

[18] D. J. Felleman,et al. Topographic reorganization of somatosensory cortical areas 3b and 1 in adult monkeys following restricted deafferentation , 1983, Neuroscience.

[19] J. Friedman. A VARIABLE SPAN SMOOTHER , 1984 .

[20] William H. Press,et al. Numerical Recipes in C The Art of Scientific Computing , 1995 .

[21] John J. Craig,et al. Introduction to Robotics Mechanics and Control , 1986 .

[22] R. H. Myers. Classical and modern regression with applications , 1986 .

[23] Farmer,et al. Predicting chaotic time series. , 1987, Physical review letters.

[24] Helge Ritter,et al. Topology conserving mappings for learning motor tasks , 1987 .

[25] Stephen Grossberg,et al. A massively parallel architecture for a self-organizing neural pattern recognition machine , 1988, Comput. Vis. Graph. Image Process..

[26] M. J. D. Powell,et al. Radial basis functions for multivariable interpolation: a review , 1987 .

[27] William H. Press,et al. Numerical Recipes in FORTRAN - The Art of Scientific Computing, 2nd Edition , 1987 .

[28] W. Cleveland,et al. Regression by local fitting: Methods, properties, and computational algorithms , 1988 .

[29] W. Cleveland,et al. Locally Weighted Regression: An Approach to Regression Analysis by Local Fitting , 1988 .

[30] D. Sparks,et al. Population coding of saccadic eye movements by neurons in the superior colliculus , 1988, Nature.

[31] J. Doyne Farmer,et al. Exploiting Chaos to Predict the Future and Reduce Noise , 1989 .

[32] Christopher G. Atkeson,et al. Using Local Models to Control Movement , 1989, NIPS.

[33] William H. Press,et al. Book-Review - Numerical Recipes in Pascal - the Art of Scientific Computing , 1989 .

[34] Terence D. Sanger,et al. Optimal unsupervised learning in a single-layer linear feedforward neural network , 1989, Neural Networks.

[35] C. Atkeson,et al. Learning arm kinematics and dynamics. , 1989, Annual review of neuroscience.

[36] T Poggio,et al. Regularization Algorithms for Learning That Are Equivalent to Multilayer Networks , 1990, Science.

[37] G. Wahba. Spline models for observational data , 1990 .

[38] Andrew W. Moore,et al. Efficient memory-based learning for robot control , 1990 .

[39] John Baillieul,et al. Resolution of kinematic redundancy , 1990 .

[40] Marcus Frean,et al. The Upstart Algorithm: A Method for Constructing and Training Feedforward Neural Networks , 1990, Neural Computation.

[41] R. Tibshirani,et al. Generalized Additive Models , 1991 .

[42] John C. Platt. A Resource-Allocating Network for Function Interpolation , 1991, Neural Computation.

[43] J. Friedman. Multivariate adaptive regression splines , 1990 .

[44] Peter J. Millington,et al. Associative reinforcement learning for optimal control , 1991 .

[45] Terence D. Sanger,et al. A tree-structured adaptive network for function approximation in high-dimensional spaces , 1991, IEEE Trans. Neural Networks.

[46] A. Georgopoulos. Higher order motor control. , 1991, Annual review of neuroscience.

[47] Andrew W. Moore,et al. Fast, Robust Adaptive Control by Learning only Forward Models , 1991, NIPS.

[48] Richard S. Sutton,et al. Iterative Construction of Sparse Polynomial Approximations , 1991, NIPS.

[49] Geoffrey E. Hinton,et al. Adaptive Mixtures of Local Experts , 1991, Neural Computation.

[50] Phillip J. McKerrow,et al. Introduction to robotics , 1991 .

[51] David W. Scott,et al. Multivariate Density Estimation: Theory, Practice, and Visualization , 1992, Wiley Series in Probability and Statistics.

[52] Jianqing Fan,et al. Variable Bandwidth and Local Linear Regression Smoothers , 1992 .

[53] E. Littmann. Generalization Abilities of Cascade Network Architectures , 1992 .

[54] Martin Casdagli,et al. Nonlinear Modeling And Forecasting , 1992 .

[55] Helge J. Ritter,et al. Generalization Abilities of Cascade Network Architecture , 1992, NIPS.

[56] L. Cooper,et al. When Networks Disagree: Ensemble Methods for Hybrid Neural Networks , 1992 .

[57] Elie Bienenstock,et al. Neural Networks and the Bias/Variance Dilemma , 1992, Neural Computation.

[58] Richard S. Sutton,et al. Adapting Bias by Gradient Descent: An Incremental Version of Delta-Bar-Delta , 1992, AAAI.

[59] J. Friedman,et al. A Statistical View of Some Chemometrics Regression Tools , 1993 .

[60] Stefan Schaal,et al. Assessing the Quality of Learned Local Models , 1993, NIPS.

[61] T. Hastie,et al. Local Regression: Automatic Kernel Carpentry , 1993 .

[62] Heekuck Oh,et al. Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[63] F A Mussa-Ivaldi,et al. Adaptive representation of dynamics during learning of a motor task , 1994, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[64] Thomas Martinetz,et al. Topology representing networks , 1994, Neural Networks.

[65] Michael I. Jordan,et al. Hierarchical Mixtures of Experts and the EM Algorithm , 1994, Neural Computation.

[66] S. Schaal,et al. Robot juggling: implementation of memory-based learning , 1994, IEEE Control Systems.

[67] Ferdinando A. Mussa-Ivaldi,et al. Interference in Learning Internal Models of Inverse Dynamics in Humans , 1994, NIPS.

[68] R. J. Tibshirani,et al. Nonparametric Regression and Classification Part I—Nonparametric Regression , 1994 .

[69] Cesare Furlanello,et al. Connectionist Speaker Normalization with Generalized Resource Allocating Networks , 1994, NIPS.

[70] Bernd Fritzke,et al. Growing cell structures--A self-organizing network for unsupervised and supervised learning , 1994, Neural Networks.

[71] David J. Field,et al. What Is the Goal of Sensory Coding? , 1994, Neural Computation.

[72] Jean-Jacques E. Slotine,et al. Space-frequency localized basis function networks for nonlinear system estimation and control , 1995, Neurocomputing.

[73] Patrick van der Smagt,et al. Approximation with neural networks: between local and global approximation , 1995, Proceedings of ICNN'95 - International Conference on Neural Networks.

[74] Carl E. Rasmussen,et al. In Advances in Neural Information Processing Systems , 2011 .

[75] Joydeep Ghosh,et al. Ridge polynomial networks , 1995, IEEE Trans. Neural Networks.

[76] Gerald Sommer,et al. Dynamic Cell Structure Learns Perfectly Topology Preserving Map , 1995, Neural Computation.

[77] Mark J. L. Orr,et al. Regularization in the Selection of Radial Basis Function Centers , 1995, Neural Computation.

[78] Stefan Schaal,et al. Memory-based neural networks for robot learning , 1995, Neurocomputing.

[79] M. Kawato,et al. Internal representations of the motor apparatus: implications from generalization in visuomotor learning. , 1995, Journal of experimental psychology. Human perception and performance.

[80] Michael A. Arbib,et al. The handbook of brain theory and neural networks , 1995, A Bradford book.

[81] David G. Lowe,et al. Similarity Metric Learning for a Variable-Kernel Classifier , 1995, Neural Computation.

[82] C. Furlanello,et al. Combining local PCA and radial basis function networks for speaker normalization , 1995, Proceedings of 1995 IEEE Workshop on Neural Networks for Signal Processing.

[83] Stefan Schaal,et al. From Isolation to Cooperation: An Alternative View of a System of Experts , 1995, NIPS.

[84] D. Yeung,et al. Constructive feedforward neural networks for regression problems : a survey , 1995 .

[85] Jianqing Fan,et al. Data‐Driven Bandwidth Selection in Local Polynomial Fitting: Variable Bandwidth and Spatial Adaptation , 1995 .

[86] Volker Tresp,et al. Improved Gaussian Mixture Density Estimates Using Bayesian Penalty Terms and Network Averaging , 1995, NIPS.

[87] G. Deco,et al. An Information-Theoretic Approach to Neural Computing , 1997, Perspectives in Neural Computing.

[88] Alexander J. Smola,et al. Support Vector Method for Function Approximation, Regression Estimation and Signal Processing , 1996, NIPS.

[89] David J. Field,et al. Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[90] Harry Wechsler,et al. From Statistics to Neural Networks: Theory and Pattern Recognition Applications , 1996 .

[91] Terrence J. Sejnowski,et al. The Computational Brain , 1996, Artif. Intell..

[92] W. Cleveland,et al. Smoothing by Local Regression: Principles and Methods , 1996 .

[93] Stefan Schaal,et al. Local dimensionality reduction for locally weighted learning , 1997, Proceedings 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation CIRA'97. 'Towards New Computational Principles for Robotics and Automation'.

[94] Stefan Schaal,et al. Receptive Field Weighted Regression , 1997 .

[95] Stefan Schaal,et al. Local Dimensionality Reduction , 1997, NIPS.

[96] T. Sejnowski,et al. Irresistible environment meets immovable neurons , 1997, Behavioral and Brain Sciences.

[97] D. Wolpert,et al. Internal models in the cerebellum , 1998, Trends in Cognitive Sciences.

[98] Christopher G. Atkeson,et al. Constructive Incremental Learning from Only Local Information , 1998, Neural Computation.

[99] John Daugman,et al. Gabor wavelets for statistical pattern recognition , 1998 .

[100] Alexander J. Smola,et al. Support Vector Machine Reference Manual , 1998 .

[101] Mitsuo Kawato,et al. Internal models for motor control and trajectory planning , 1999, Current Opinion in Neurobiology.

[102] Stefan Schaal,et al. Is imitation learning the route to humanoid robots? , 1999, Trends in Cognitive Sciences.

[103] Stefan Schaal,et al. Locally Weighted Projection Regression : An O(n) Algorithm for Incremental Real Time Learning in High Dimensional Space , 2000 .

[104] Stefan Schaal,et al. Locally Weighted Projection Regression: Incremental Real Time Learning in High Dimensional Space , 2000, ICML.

[105] Mikael Johansson. Piecewise Linear Modeling , 2003 .

[106] Andrew W. Moore,et al. Locally Weighted Learning for Control , 1997, Artificial Intelligence Review.

[107] Corinna Cortes,et al. Support-Vector Networks , 1995, Machine Learning.

[108] D. L. Reilly,et al. A neural model for category learning , 1982, Biological Cybernetics.

[109] Andrew W. Moore,et al. Locally Weighted Learning , 1997, Artificial Intelligence Review.

[110] Christian Jutten,et al. A new scheme for incremental learning , 2006, Neural Processing Letters.

[111] V. Vapnik. Estimation of Dependences Based on Empirical Data , 2006 .

[112] R. Sutton. Gain Adaptation Beats Least Squares , 2006 .