Preserving Causal Constraints in Counterfactual ... · Preserving Causal Constraints in...

Preserving Causal Constraints in Counterfactual Explanations for Machine Learning ClassifiersDivyat Mahajan1; Chenhao Tan2; Amit Sharma1

1Microsoft Research, 2University of Colorado Boulder

“Do the right thing”: machine learning and causal inference for improved decision making

Vancouver, Canada

NeurIPS 2019 Workshop

[1] Amit Dhurandhar, Pin-Yu Chen, Ronny Luss, Chun-Chen Tu, Paishun Ting, Karthikeyan Shanmugam, and Payel Das. Explanations based on the missing: Towards contrastive explanations with pertinent negatives. In Advances in Neural Information Processing Systems, pages 592–603, 2018

References

- Counterfactual Explanations promise to provide faithful

and actionable explanations for ML classifiers

- Actionability of counterfactual explanations rests on

preserving certain feasibility constraints

Feasibility of Counterfactual Explanations

- Knowledge of Structural Causal Models might be

impractical in real life datasets

- Oracle implicitly models the constraint and provides a

black box access via feasibility score

- Oracle can represent user feedback to preserve

user specific / local constraints

- Oracle could be used to represent complex global

constraints which are hard to optimize directly

- Score corresponding to Labelled CFs (𝑞𝑐𝑓) via Oracle:

OracleGenCF: e−(𝑥𝑐𝑓 – 𝑞𝑐𝑓 )𝑇(𝑥𝑐𝑓 – 𝑞𝑐𝑓 )

Causal Connection to Feasibility

Generative Modelling of CF Explanations

BaseGenCF: Variational Inference based loss

AEGenCF: BaseGenCF + Reconstruction Loss on CF via

Auto Encoder [1]

SCMGenCF: BaseGenCF + Causal Proximity Regularizer

ModelApproxGenCF: BaseGenCF + Constraint Based

OracleGenCF: BaseGenCF + Loss with CFs labelled via

Oracle

Our Approaches- Global Feasibility:

A counterfactual explanation < 𝑥𝑐𝑓, 𝑦𝑐𝑓 > is globally

feasible if it is valid ( 𝑦𝑐𝑓 = 𝑦′ ) and changes from

𝑥 to 𝑥𝑐𝑓 satisfies all the constraints given by the

underlying causal model

- We can use the causal knowledge to define a better

notion of Distance to preserve constraints (SCMGenCF)

𝐷𝑖𝑠𝑡𝐶𝑎𝑢𝑠𝑎𝑙 𝑥𝑣 , 𝑥𝑣𝑐𝑓

= 𝐷𝑖𝑠𝑡𝑎𝑛𝑐𝑒 𝑥𝑣𝑐𝑓 , 𝑓 𝑥𝑣𝑝1

𝑐𝑓, … . , 𝑥𝑣𝑝𝑘𝑐𝑓

where 𝑣𝑝1, . . , 𝑣𝑝𝑘 are the direct causes of

𝑣 and 𝑓 represents the ML Classifier to be explained

Preserving Feasibility via Oracle Results

- Simple-BN: Synthetic dataset with monotonic

constraint

- Sangiovese: Bayesian Network with monotonic

constraint

- Adult: Real World dataset with unary and monotonic

constraint

- Evaluation Metrics:

- Validity, Proximity, Constraint Feasibility, Causal

Edge Score, Causal Graph Score

- Variational Inference based approach: - Encoder 𝑞(𝑧|𝑥, 𝑦’) embeds data point into the latent space

- Decoder 𝑝(𝑥𝑐𝑓|𝑧, 𝑦’) generates the counterfactual in class 𝑦’ from

the latent encoding

- Learn the encoder and decoder by minimizing the following loss:

𝑚𝑖𝑛 Е𝑞(𝑧|𝑥, 𝑦′) [𝐷𝑖𝑠𝑡𝑎𝑛𝑐𝑒(𝑥, 𝑥𝑐𝑓) + 𝜆 ∗

𝐻𝑖𝑛𝑔𝑒𝐿𝑜𝑠𝑠(𝑓(𝑥𝑐𝑓), 𝑦’, β) ] + 𝐾𝐿(𝑞(𝑧|𝑥, 𝑦’)||𝑝(𝑧))

Our approach scales

better with data points

as compared to the

state of the art [1]

Preserving Causal Constraints in Counterfactual ... · Preserving Causal Constraints in...

Documents

Introducing Counterfactual Causal Inference

How, whether, why: Causal judgments as counterfactual

Alternative realities: Counterfactual historical fiction ...shura.shu.ac.uk/19154/1/Raghunath_2017_PhD_AlternativeRealities... · Alternative Realities: Counterfactual Historical

3 Causal Models Part II: Counterfactual Theory and Traditional Approaches to Confounding (Bias?) Confounding, Identifiability, Collapsibility and Causal

Counterfactual Dependence and Arrow - PhilSci-Archivephilsci-archive.pitt.edu › 10841 › 1 › Counterfactual... · Counterfactual Dependence and Arrow Thomas Kroedel Humboldt

The Role of Counterfactual Reasoning in Causal Judgements

Counterfactual Imagination

5.1.2 counterfactual framework

Reliable Decision Support using Counterfactual Models · Reliable Decision Support using Counterfactual Models ... and propose the counterfactual GP (CGP), a ... In this paper,

Counterfactual Spaces

Superexplanations for Counterfactual Knowledge

Counterfactual Thinkingschaller/Psyc590... · Counterfactual thinking has a net benefit for the individual. To begin with, Counterfactual thinking is activated by negative af-fect

Causal inference in statistics: An overviewapophenia.wdfiles.com/local--files/start/pearl_2009_causal_inference… · 1. Counterfactual analysis 2. Nonparametric structural equations

Counterfactual Desirability 4th Revision2

Causal Structuralism, Dispositional Actualism, and ... · Causal Structuralism, Dispositional Actualism, and Counterfactual Conditionals Antony Eagle Forthcoming in Toby Handﬁeld

The role of learning data in causal reasoning about ......Causal reasoning about observations and interventions 251 the factual reading—i.e., a counterfactual intervention), you

Introduction to Causal Inference\Confounding bias" 5/68. World War II Abraham Wald ... The observed outcome is the counterfactual corresponding to the treatment you did indeed take

Causal inference and counterfactual prediction in machine

DOES MARRIAGE REDUCE CRIME? A COUNTERFACTUAL …faculty.washington.edu/matsueda/courses/517/...DOES MARRIAGE REDUCE CRIME? A COUNTERFACTUAL APPROACH TO WITHIN-INDIVIDUAL CAUSAL EFFECTS

Causal inference in statistics: An overview · 1. Counterfactual analysis 2. Nonparametric structural equations 3. Graphical models 4. Symbiosis between counterfactual and graphical