Dependency Networks for Inference, Collaborative Filtering, and Data Visualization

We describe a graphical model for probabilistic relationships--an alternative to the Bayesian network--called a dependency network. The graph of a dependency network, unlike a Bayesian network, is potentially cyclic. The probability component of a dependency network, like a Bayesian network, is a set of conditional distributions, one for each node given its parents. We identify several basic properties of this representation and describe a computationally efficient procedure for learning the graph and probability components from data. We describe the application of this representation to probabilistic inference, collaborative filtering (the task of predicting preferences), and the visualization of acausal predictive relationships.

[1]  Feller William,et al.  An Introduction To Probability Theory And Its Applications , 1950 .

[2]  J. Kiefer,et al.  An Introduction to Stochastic Processes. , 1956 .

[3]  D. Brook On the distinction between the conditional probability and the joint probability approaches in the specification of nearest-neighbour systems , 1964 .

[4]  W. H. Sewell,et al.  Social Class, Parental Encouragement, and Educational Aspirations , 1968, American Journal of Sociology.

[5]  J. Besag Spatial Interaction and the Statistical Analysis of Lattice Systems , 1974 .

[6]  J. Besag Statistical Analysis of Non-Lattice Data , 1975 .

[7]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  E. Fowlkes,et al.  Evaluating Logistic Models for Large Contingency Tables , 1988 .

[9]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[10]  Steffen L. Lauritzen,et al.  Independence properties of directed markov fields , 1990, Networks.

[11]  Wray L. Buntine Theory Refinement on Bayesian Networks , 1991, UAI.

[12]  Wray L. BuntineRIACS Theory Reenement on Bayesian Networks , 1991 .

[13]  H. Kyburg Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference by Judea Pearl , 1991 .

[14]  Rafael Molina,et al.  On the Bayesian Approach to Learning , 1992 .

[15]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[16]  John Riedl,et al.  GroupLens: an open architecture for collaborative filtering of netnews , 1994, CSCW '94.

[17]  J. Besag,et al.  Bayesian Computation and Stochastic Systems , 1995 .

[18]  Brendan J. Frey,et al.  Does the Wake-sleep Algorithm Produce Good Density Estimators? , 1995, NIPS.

[19]  Peter Green,et al.  Markov chain Monte Carlo in Practice , 1996 .

[20]  Nir Friedman,et al.  Learning Bayesian Networks with Local Structure , 1996, UAI.

[21]  David Maxwell Chickering,et al.  A Bayesian Approach to Learning Bayesian Networks with Local Structure , 1997, UAI.

[22]  David Heckerman,et al.  Models and Selection Criteria for Regression and Classification , 1997, UAI.

[23]  Volker Tresp,et al.  Nonlinear Markov Networks for Continuous Variables , 1997, NIPS.

[24]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[25]  Michael I. Jordan Graphical Models , 1998 .

[26]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[27]  B. Schölkopf,et al.  Advances in kernel methods: support vector learning , 1999 .