论文信息 - Bayesian backfitting for high dimensional regression

Bayesian backfitting for high dimensional regression

Whenever a graphical model contains connections from multiple nodes to a single node, statistical inference of model parameters may require the evaluation and possibly the inversion of the covariance matrix of all variables contributing to such a fan-in, particularly in the context of regression and classification. Thus, for high dimensional fanins, statistical inference can become computationally rather expensive and numerically brittle. In this paper, we propose an EM-based estimation method that statistically decouples the inputs by the introduction of hidden variables in each branch of the fan-in. As a result, the algorithm has a per-iteration complexity that is only linear in the order of the fanin. Interestingly, the resulting algorithm can be interpreted as a probabilistic version of backfitting, and consequently, is ideally suited for applications of backfitting that require to cleanly propagate probabilities, as in Bayesian inference. We demonstrate the effectiveness of Bayesian Backfitting in dealing with extremely high-dimensional, underconstrained regression problems. In addition we highlight its connection to probabilistic partial least squares regression, and its extensions to nonlinear datasets through variational Bayesian mixture of experts regression, and nonparametric locally weighted learning.

[1] J. Friedman,et al. [A Statistical View of Some Chemometrics Regression Tools]: Response , 1993 .

[2] J. Friedman,et al. A Statistical View of Some Chemometrics Regression Tools , 1993 .

[3] Stefan Schaal,et al. Local Dimensionality Reduction , 1997, NIPS.

[4] R. Tibshirani,et al. Generalized additive models for medical research , 1986, Statistical methods in medical research.

[5] T Poggio,et al. Regularization Algorithms for Learning That Are Equivalent to Multilayer Networks , 1990, Science.

[6] Zoubin Ghahramani,et al. Variational Inference for Bayesian Mixtures of Factor Analysers , 1999, NIPS.

[7] Stefan Schaal,et al. Are internal models of the entire body learnable , 2001 .

[8] Geoffrey E. Hinton,et al. An Alternative Model for Mixtures of Experts , 1994, NIPS.

[9] Geoffrey E. Hinton,et al. Bayesian Learning for Neural Networks , 1995 .

[10] Stefan Schaal,et al. Locally Weighted Projection Regression : An O(n) Algorithm for Incremental Real Time Learning in High Dimensional Space , 2000 .

[11] Robert A. Jacobs,et al. Hierarchical Mixtures of Experts and the EM Algorithm , 1993, Neural Computation.

[12] David A. Belsley,et al. Regression Analysis and its Application: A Data-Oriented Approach.@@@Applied Linear Regression.@@@Regression Diagnostics: Identifying Influential Data and Sources of Collinearity , 1981 .