Formalizing Neurath’s Ship: Approximate Algorithms for Online Causal Learning

Higher-level cognition depends on the ability to learn models of the world. We can characterize this at the computational level as a structure-learning problem with the goal of best identifying the prevailing causal relationships among a set of relata. However, the computational cost of performing exact Bayesian inference over causal models grows rapidly as the number of relata increases. This implies that the cognitive processes underlying causal learning must be substantially approximate. A powerful class of approximations that focuses on the sequential absorption of successive inputs is captured by the Neurath’s ship metaphor in philosophy of science, where theory change is cast as a stochastic and gradual process shaped as much by people’s limited willingness to abandon their current theory when considering alternatives as by the ground truth they hope to approach. Inspired by this metaphor and by algorithms for approximating Bayesian inference in machine learning, we propose an algorithmic-level model of causal structure learning under which learners represent only a single global hypothesis that they update locally as they gather evidence. We propose a related scheme for understanding how, under these limitations, learners choose informative interventions that manipulate the causal system to help elucidate its workings. We find support for our approach in the analysis of 3 experiments.

[1]  W. James,et al.  The Principles of Psychology. , 1983 .

[2]  Illtyd Trethowan Causality , 1938 .

[3]  E. H. Neville Mathematische Werke , 1948, Nature.

[4]  Claude E. Shannon,et al.  The mathematical theory of communication , 1950 .

[5]  G. Brier VERIFICATION OF FORECASTS EXPRESSED IN TERMS OF PROBABILITY , 1950 .

[6]  Claude E. Shannon,et al.  Prediction and Entropy of Printed English , 1951 .

[7]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[8]  J. Becker,et al.  The Aim and Structure of Physical Theory , 1955 .

[9]  R. Duncan Luce,et al.  Individual Choice Behavior , 1959 .

[10]  Willard Van Orman Quine,et al.  Word and Object , 1960 .

[11]  T. Kuhn,et al.  The Structure of Scientific Revolutions. , 1964 .

[12]  R. Rescorla,et al.  A theory of Pavlovian conditioning : Variations in the effectiveness of reinforcement and nonreinforcement , 1972 .

[13]  A. Tversky,et al.  Judgment under Uncertainty: Heuristics and Biases , 1974, Science.

[14]  I. Lakatos Falsification and the Methodology of Scientific Research Programmes , 1976 .

[15]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[16]  R. W. Robinson Counting unlabeled acyclic digraphs , 1977 .

[17]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[18]  M. L. Fisher,et al.  An analysis of approximations for maximizing submodular set functions—I , 1978, Math. Program..

[19]  Klaus Krippendorff,et al.  Content Analysis: An Introduction to Its Methodology , 1980 .

[20]  K. A. Ericsson,et al.  Verbal reports as data. , 1980 .

[21]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  R. Hogarth,et al.  Judging probable cause. , 1986 .

[23]  Emile H. L. Aarts,et al.  Simulated Annealing: Theory and Applications , 1987, Mathematics and Its Applications.

[24]  Stuart German,et al.  Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images , 1988 .

[25]  Eric J. Johnson,et al.  The validity of verbal protocols , 1989, Memory & cognition.

[26]  J. Klayman,et al.  Hypothesis testing in rule discovery: Strategy, structure, and content. , 1989 .

[27]  Keiji Kanazawa,et al.  A model for reasoning about persistence and causation , 1989 .

[28]  John R. Anderson The Adaptive Character of Thought , 1990 .

[29]  Gregory F. Cooper,et al.  The Computational Complexity of Probabilistic Inference Using Bayesian Belief Networks , 1990, Artif. Intell..

[30]  L. T. DeCarlo Intertrial interval and sequential effects in magnitude scaling. , 1992, Journal of experimental psychology. Human perception and performance.

[31]  R. Hogarth,et al.  Order effects in belief updating: The belief-adjustment model , 1992, Cognitive Psychology.

[32]  K. A. Ericsson,et al.  Protocol analysis: Verbal reports as data, Rev. ed. , 1993 .

[33]  D. Madigan,et al.  Model Selection and Accounting for Model Uncertainty in Graphical Models Using Occam's Window , 1994 .

[34]  J. York,et al.  Bayesian Graphical Models for Discrete Data , 1995 .

[35]  W. Krauth,et al.  Dynamical mean-field theory of strongly correlated fermion systems and the limit of infinite dimensions , 1996 .

[36]  H. Simon Models of Bounded Rationality: Empirically Grounded Economic Reason , 1997 .

[37]  Jun S. Liu,et al.  Sequential Monte Carlo methods for dynamic systems , 1997 .

[38]  Jeffrey K. Uhlmann,et al.  New extension of the Kalman filter to nonlinear systems , 1997, Defense, Security, and Sensing.

[39]  P. Cheng From covariation to causation: A causal power theory. , 1997 .

[40]  R. Nickerson Confirmation Bias: A Ubiquitous Phenomenon in Many Guises , 1998 .

[41]  P. Todd,et al.  Simple Heuristics That Make Us Smart , 1999 .

[42]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[43]  Tommi S. Jaakkola,et al.  Tutorial on variational approximation methods , 2000 .

[44]  D. Gilden Cognitive emissions of 1/f noise. , 2001, Psychological review.

[45]  Gerd Gigerenzer,et al.  The adaptive toolbox. , 2001 .

[46]  Refractor Vision , 2000, The Lancet.

[47]  Peter Dayan,et al.  Expected and Unexpected Uncertainty: ACh and NE in the Neocortex , 2002, NIPS.

[48]  Nancy S. Kim,et al.  Clinical psychologists' theory-based representations of mental disorders predict their diagnostic reasoning and memory. , 2002, Journal of experimental psychology. General.

[49]  Daphne Koller,et al.  Continuous Time Bayesian Networks , 2012, UAI.

[50]  Eric R. Ziegel,et al.  An Introduction to Generalized Linear Models , 2002, Technometrics.

[51]  S. Lauritzen,et al.  Chain graph models and their causal interpretations , 2002 .

[52]  B. Newell,et al.  Take the best or look at the rest? Factors influencing "one-reason" decision making. , 2003, Journal of experimental psychology. Learning, memory, and cognition.

[53]  Joshua B. Tenenbaum,et al.  Inferring causal networks from observations and interventions , 2003, Cogn. Sci..

[54]  Jonathan Evans In two minds: dual-process accounts of reasoning , 2003, Trends in Cognitive Sciences.

[55]  S. Sloman,et al.  The advantage of timely intervention. , 2004, Journal of experimental psychology. Learning, memory, and cognition.

[56]  Stergios B. Fotopoulos,et al.  Introduction to Modern Nonparametric Statistics , 2004, Technometrics.

[57]  Richard Szeliski,et al.  Bayesian modeling of uncertainty in low-level vision , 2011, International Journal of Computer Vision.

[58]  Gregory F. Cooper,et al.  A Bayesian method for the induction of probabilistic networks from data , 1992, Machine Learning.

[59]  Dov M. Gabbay,et al.  Recursive Causality in Bayesian Networks and Self-Fibring Networks , 2004 .

[60]  S. Gosling,et al.  Should we trust web-based studies? A comparative analysis of six preconceptions about internet questionnaires. , 2004, The American psychologist.

[61]  Isbn The Principles of Psychology: Volume 1 , 2004 .

[62]  Konrad Paul Kording,et al.  Bayesian integration in sensorimotor learning , 2004, Nature.

[63]  John R. Anderson,et al.  The dynamics of scaling: a memory-based anchor model of category rating and absolute identification. , 2005, Psychological review.

[64]  M. Treisman,et al.  A Theory of Criterion Setting With an Application to Sequential Dependencies , 2005 .

[65]  S. Sloman Causal Models: How People Think about the World and Its Alternatives , 2005 .

[66]  Jonathan D. Nelson Finding useful questions: on Bayesian diagnosticity, probability, impact, and information gain. , 2005, Psychological review.

[67]  M. Miyazaki,et al.  Testing Bayesian models of human coincidence timing. , 2005, Journal of neurophysiology.

[68]  Petter Johansson,et al.  Failure to Detect Mismatches Between Intention and Outcome in a Simple Decision Task , 2005, Science.

[69]  Kevin Murphy,et al.  Active Learning of Causal Bayes Net Structure , 2006 .

[70]  Constantin F. Aliferis,et al.  The max-min hill-climbing Bayesian network structure learning algorithm , 2006, Machine Learning.

[71]  David M. Sobel,et al.  The importance of decision making in causal learning from interventions , 2006, Memory & cognition.

[72]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[73]  S. Sloman,et al.  Time as a guide to cause. , 2006, Journal of experimental psychology. Learning, memory, and cognition.

[74]  Jon Williamson,et al.  Causality and Probability in the Sciences , 2007 .

[75]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[76]  A. Gopnik,et al.  Causal learning : psychology, philosophy, and computation , 2007 .

[77]  Aaron C. Courville,et al.  The rat as particle filter , 2007, NIPS.

[78]  J. Tenenbaum,et al.  Two proposals for causal grammars , 2007 .

[79]  Thomas L. Griffiths,et al.  Modeling human function learning with Gaussian processes , 2008, NIPS.

[80]  Thomas L. Griffiths,et al.  A Rational Analysis of Rule-Based Concept Learning , 2008, Cogn. Sci..

[81]  A. Yuille,et al.  Bayesian generic priors for causal learning. , 2008, Psychological review.

[82]  Michael R. Waldmann,et al.  Causal learning in rats and humans: A minimal rational model , 2008 .

[83]  Philip M. Fernbach,et al.  Causal learning with local computations. , 2009, Journal of experimental psychology. Learning, memory, and cognition.

[84]  Doug Markant,et al.  Active learning strategies in a spatial concept learning game , 2009 .

[85]  W. Teepe,et al.  Protocol Analysis , 2012 .

[86]  Joshua B Tenenbaum,et al.  Theory-based causal induction. , 2009, Psychological review.

[87]  David Cohn,et al.  Active Learning , 2010, Encyclopedia of Machine Learning.

[88]  D. Shanks,et al.  Learning in a changing environment. , 2010, Journal of experimental psychology. General.

[89]  Joshua B. Tenenbaum,et al.  Edge replacement and nonindependence in causation , 2010 .

[90]  Steven A. Sloman,et al.  Beyond covariation: Cues to causal structure. , 2010 .

[91]  Adam N Sanborn,et al.  Rational approximations to rational models: alternative algorithms for category learning. , 2010, Psychological review.

[92]  Todd M. Gureckis,et al.  Category Learning Through Active Sampling , 2010 .

[93]  Alison Gopnik,et al.  Inferring Hidden Causal Structure , 2009, Cogn. Sci..

[94]  K. Holyoak,et al.  Causal learning and inference as a rational process: the new synthesis. , 2011, Annual review of psychology.

[95]  Amy Perfors,et al.  Hypothesis generation, sparse categories, and the positive test strategy. , 2011, Psychological review.

[96]  Thomas L. Griffiths,et al.  A rational model of causal induction with continuous causes , 2011, NIPS 2011.

[97]  Thomas L. Griffiths,et al.  Seeking Confirmation Is Rational for Deterministic Hypotheses , 2011, Cogn. Sci..

[98]  Frank Nielsen,et al.  A closed-form expression for the Sharma–Mittal entropy of exponential families , 2011, ArXiv.

[99]  Thomas L. Griffiths,et al.  Exploring the influence of particle filter parameters on order effects in causal learning , 2011, CogSci.

[100]  Thomas L. Griffiths,et al.  Estimating human priors on causal strength , 2011, CogSci.

[101]  Thomas L. Griffiths,et al.  Human memory search as a random walk in a semantic network , 2012, NIPS.

[102]  Noah D. Goodman,et al.  Theory learning as stochastic search in the language of thought , 2012 .

[103]  Thomas L. Griffiths,et al.  Elements of a rational framework for continuous-time causal induction , 2012, CogSci.

[104]  Thomas L. Griffiths,et al.  "Burn-in, bias, and the rationality of anchoring" , 2012, NIPS.

[105]  Siddharth Suri,et al.  Conducting behavioral research on Amazon’s Mechanical Turk , 2010, Behavior research methods.

[106]  Todd M. Gureckis,et al.  Does the utility of information influence sampling behavior? , 2012, CogSci.

[107]  Joshua B. Tenenbaum,et al.  Multistability and Perceptual Inference , 2012, Neural Computation.

[108]  Brad E. Pfeiffer,et al.  Hippocampal place cell sequences depict future paths to remembered goals , 2013, Nature.

[109]  Todd M. Gureckis,et al.  Evaluating Amazon's Mechanical Turk as a Tool for Experimental Behavioral Research , 2013, PloS one.

[110]  Jessica B. Hamrick,et al.  Simulation as an engine of physical scene understanding , 2013, Proceedings of the National Academy of Sciences.

[111]  P. Dayan,et al.  Goals and Habits in the Brain , 2013, Neuron.

[112]  Noah D. Goodman,et al.  Learning physics from dynamical scenes , 2014 .

[113]  Christopher G. Lucas,et al.  Discovering hidden causes using statistical evidence , 2014, CogSci.

[114]  Johan Kwisthout,et al.  Rational analysis, intractability, and the prospects of ‘as if’-explanations , 2014, Synthese.

[115]  Thomas L. Griffiths,et al.  One and Done? Optimal Decisions From Very Few Samples , 2014, Cogn. Sci..

[116]  B. Newell,et al.  Degraded conditions: Confounds in the study of decision making , 2014, Behavioral and Brain Sciences.

[117]  David A. Lagnado,et al.  The order of things: Inferring causal structure from temporal patterns , 2014, CogSci.

[118]  Christopher G. Lucas,et al.  When children are better (or at least more open-minded) learners than adults: Developmental differences in learning the forms of causal relationships , 2014, Cognition.

[119]  Bradley C. Love,et al.  Optimal Teaching for Limited-Capacity Human Learners , 2014, NIPS.

[120]  Jonathan D. Nelson,et al.  Children’s sequential information search is sensitive to environmental probabilities , 2014, Cognition.

[121]  B. Rehder Independence and dependence in human causal reasoning , 2014, Cognitive Psychology.

[122]  Thomas L. Griffiths,et al.  Win-Stay, Lose-Sample: A simple sequential algorithm for approximating Bayesian inference , 2014, Cognitive Psychology.

[123]  Tania Lombrozo,et al.  Learning By Asking: How Children Ask Questions To Achieve Efficient Search , 2014, CogSci.

[124]  D. Lagnado,et al.  There aren't plenty more fish in the sea: a causal network approach. , 2015, British journal of psychology.

[125]  Thomas L. Griffiths,et al.  Think again? The amount of mental simulation tracks uncertainty in the outcome , 2015, CogSci.

[126]  Neil R. Bramley,et al.  Conservative forgetful scholars: How people learn causal structure through sequences of interventions. , 2015, Journal of experimental psychology. Learning, memory, and cognition.

[127]  Thomas L. Griffiths,et al.  Rational Use of Cognitive Resources: Levels of Analysis Between the Computational and the Algorithmic , 2015, Top. Cogn. Sci..

[128]  Bradley C. Love,et al.  Active learning as a means to distinguish among prominent decision strategies , 2015, CogSci.

[129]  Peter Dayan,et al.  Staying afloat on Neurath's boat - Heuristics for sequential causal learning , 2015, CogSci.

[130]  Thomas L. Griffiths,et al.  When to use which heuristic: A rational solution to the strategy selection problem , 2015, CogSci.

[131]  B. Rehder,et al.  Strategies to intervene on causal systems are adaptively selected , 2015, Cognitive Psychology.

[132]  H. Spencer The Principles of Psychology - Vol. I , 2016 .

[133]  Caren A. Frosch,et al.  Children's use of interventions to learn causal structure. , 2016, Journal of experimental child psychology.

[134]  David J. Hauser,et al.  Attentive Turkers: MTurk participants perform better on online attention checks than do subject pool participants , 2015, Behavior Research Methods.

[135]  Burr Settles,et al.  Self-Directed Learning Favors Local, Rather Than Global, Uncertainty , 2016, Cogn. Sci..

[136]  Adam N. Sanborn Types of approximation for probabilistic cognition: Sampling and variational , 2017, Brain and Cognition.

[137]  Noah D. Goodman,et al.  The anchoring bias reflects rational use of cognitive resources , 2018, Psychonomic bulletin & review.

[138]  S. Sloman,et al.  Learning Causal Structure , 2019, Proceedings of the Twenty-Fourth Annual Conference of the Cognitive Science Society.