Fairness through Causal Awareness: Learning Causal Latent-Variable Models for Biased Data

How do we learn from biased data? Historical datasets often reflect historical prejudices; sensitive or protected attributes may affect the observed treatments and outcomes. Classification algorithms tasked with predicting outcomes accurately from these datasets tend to replicate these biases. We advocate a causal modeling approach to learning from biased data, exploring the relationship between fair classification and intervention. We propose a causal model in which the sensitive attribute confounds both the treatment and the outcome. Building on prior work in deep learning and generative modeling, we describe how to learn the parameters of this causal model from observational data alone, even in the presence of unobserved confounders. We show experimentally that fairness-aware causal modeling provides better estimates of the causal effects between the sensitive attribute, the treatment, and the outcome. We further present evidence that estimating these causal effects can help learn policies that are both more accurate and fair, when presented with a historically biased dataset.

[1]  David Sontag,et al.  Why Is My Classifier Discriminatory? , 2018, NeurIPS.

[2]  Enhancing the outcomes of low-birth-weight, premature infants. A multisite, randomized trial. The Infant Health and Development Program. , 1990, JAMA.

[3]  Elias Bareinboim,et al.  Fairness in Decision-Making - The Causal Explanation Formula , 2018, AAAI.

[4]  Amos J. Storkey,et al.  Towards a Neural Statistician , 2016, ICLR.

[5]  Katrina Ligett,et al.  Penalizing Unfairness in Binary Classification , 2017 .

[6]  Volker Roth,et al.  Causal Deep Information Bottleneck , 2018, ArXiv.

[7]  Alexandra Chouldechova,et al.  Fair prediction with disparate impact: A study of bias in recidivism prediction instruments , 2016, Big Data.

[8]  Alexander M. Rush,et al.  Semi-Amortized Variational Autoencoders , 2018, ICML.

[9]  John Langford,et al.  A Reductions Approach to Fair Classification , 2018, ICML.

[10]  Michael I. Jordan,et al.  Graphical Models, Exponential Families, and Variational Inference , 2008, Found. Trends Mach. Learn..

[11]  Daan Wierstra,et al.  Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.

[12]  P. Holland Statistics and Causal Inference , 1985 .

[13]  Jure Leskovec,et al.  The Selective Labels Problem: Evaluating Algorithmic Predictions in the Presence of Unobservables , 2017, KDD.

[14]  Nathan Srebro,et al.  Equality of Opportunity in Supervised Learning , 2016, NIPS.

[15]  Suresh Venkatasubramanian,et al.  Runaway Feedback Loops in Predictive Policing , 2017, FAT.

[16]  Toniann Pitassi,et al.  Fairness through awareness , 2011, ITCS '12.

[17]  Aditya Krishna Menon,et al.  The cost of fairness in binary classification , 2018, FAT.

[18]  Robert O. Keohane,et al.  Designing Social Inquiry: Scientific Inference in Qualitative Research. , 1995 .

[19]  Alexander A. Alemi,et al.  Fixing a Broken ELBO , 2017, ICML.

[20]  Uri Shalit,et al.  Learning Representations for Counterfactual Inference , 2016, ICML.

[21]  Uri Shalit,et al.  Estimating individual treatment effect: generalization bounds and algorithms , 2016, ICML.

[22]  Sepp Hochreiter,et al.  Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) , 2015, ICLR.

[23]  K. Lum,et al.  To predict and serve? , 2016 .

[24]  Krishna P. Gummadi,et al.  Fairness Constraints: Mechanisms for Fair Classification , 2015, AISTATS.

[25]  Illtyd Trethowan Causality , 1938 .

[26]  T. VanderWeele,et al.  On the causal interpretation of race in regressions adjusting for confounding and mediating variables. , 2014, Epidemiology.

[27]  Matt J. Kusner,et al.  Causal Interventions for Fairness , 2018, ArXiv.

[28]  Max Welling,et al.  The Variational Fair Autoencoder , 2015, ICLR.

[29]  J. Brooks-Gunn,et al.  Effects of Early Intervention on Cognitive Function of Low Birth Weight Preterm Infants, , 1992, The Journal of pediatrics.

[30]  M. Kenward,et al.  Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls , 2009, BMJ : British Medical Journal.

[31]  D. Rubin Causal Inference Using Potential Outcomes , 2005 .

[32]  Percy Liang,et al.  Fairness Without Demographics in Repeated Loss Minimization , 2018, ICML.

[33]  Stef van Buuren,et al.  Flexible Imputation of Missing Data , 2012 .

[34]  Jennifer L. Hill,et al.  Bayesian Nonparametric Modeling for Causal Inference , 2011 .

[35]  Suresh Venkatasubramanian,et al.  The (Im)possibility of fairness , 2016, Commun. ACM.

[36]  Ilya Shpitser,et al.  Fair Inference on Outcomes , 2017, AAAI.

[37]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[38]  J. Robins,et al.  Identifiability and Exchangeability for Direct and Indirect Effects , 1992, Epidemiology.

[39]  Alexandra Chouldechova,et al.  Learning under selective labels in the presence of expert consistency , 2018, ArXiv.

[40]  A. Gelman,et al.  An Analysis of the New York City Police Department's “Stop-and-Frisk” Policy in the Context of Claims of Racial Bias , 2007 .

[41]  Matt J. Kusner,et al.  Counterfactual Fairness , 2017, NIPS.

[42]  Nathan Kallus,et al.  Residual Unfairness in Fair Machine Learning from Prejudiced Data , 2018, ICML.

[43]  Joichi Ito,et al.  Interventions over Predictions: Reframing the Ethical Debate for Actuarial Risk Assessment , 2017, FAT.

[44]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[45]  M. Sen,et al.  Race as a Bundle of Sticks: Designs that Estimate Effects of Seemingly Immutable Characteristics , 2016 .

[46]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[47]  Dustin Tran,et al.  Automatic Differentiation Variational Inference , 2016, J. Mach. Learn. Res..

[48]  Max Welling,et al.  Causal Effect Inference with Deep Latent-Variable Models , 2017, NIPS 2017.

[49]  Bernhard Schölkopf,et al.  Avoiding Discrimination through Causal Reasoning , 2017, NIPS.

[50]  Francisco J. R. Ruiz,et al.  Model Criticism for Bayesian Causal Inference , 2016, 1610.09037.

[51]  B. Singer,et al.  Causality in the Social Sciences , 1988 .