You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Ask your question
I couldn't find information on whether dowhy supports estimation with certain carefully chosen missing pieces of data. Consider the "voucher as instrumental variable" example from the tutorial. The problem assumes each row in the dataframe has values for all 3 observed variables (voucher, education, income). However, the estimand is
### Estimand : 1
Estimand name: iv
Estimand expression:
⎡ -1⎤
⎢ d ⎛ d ⎞ ⎥
E⎢──────────(income)⋅⎜──────────([education])⎟ ⎥
⎣d[voucher] ⎝d[voucher] ⎠ ⎦
Estimand assumption 1, As-if-random: If U→→income then ¬(U →→{voucher})
Estimand assumption 2, Exclusion: If we remove {voucher}→{education}, then ¬({voucher}→income)
which means if I can estimate the association of voucher -> income and voucher -> education, I should be able to estimate the effect of education on income.
Expected behavior
If I construct a dataset as follows -
Estimand type: EstimandType.NONPARAMETRIC_ATE
### Estimand : 1
Estimand name: backdoor
No such variable(s) found!
### Estimand : 2
Estimand name: iv
Estimand expression:
⎡ -1⎤
⎢ d ⎛ d ⎞ ⎥
E⎢──────────(income)⋅⎜──────────([education])⎟ ⎥
⎣d[voucher] ⎝d[voucher] ⎠ ⎦
Estimand assumption 1, As-if-random: If U→→income then ¬(U →→{voucher})
Estimand assumption 2, Exclusion: If we remove {voucher}→{education}, then ¬({voucher}→income)
### Estimand : 3
Estimand name: frontdoor
No such variable(s) found!
*** Causal Estimate ***
## Identified estimand
Estimand type: EstimandType.NONPARAMETRIC_ATE
### Estimand : 1
Estimand name: iv
Estimand expression:
⎡ -1⎤
⎢ d ⎛ d ⎞ ⎥
E⎢──────────(income)⋅⎜──────────([education])⎟ ⎥
⎣d[voucher] ⎝d[voucher] ⎠ ⎦
Estimand assumption 1, As-if-random: If U→→income then ¬(U →→{voucher})
Estimand assumption 2, Exclusion: If we remove {voucher}→{education}, then ¬({voucher}→income)
## Realized estimand
Realized estimand: Wald Estimator
Realized estimand type: EstimandType.NONPARAMETRIC_ATE
Estimand expression:
⎡ d ⎤
E⎢────────(income)⎥
⎣dvoucher ⎦
──────────────────────
⎡ d ⎤
E⎢────────(education)⎥
⎣dvoucher ⎦
Estimand assumption 1, As-if-random: If U→→income then ¬(U →→{voucher})
Estimand assumption 2, Exclusion: If we remove {voucher}→{education}, then ¬({voucher}→income)
Estimand assumption 3, treatment_effect_homogeneity: Each unit's treatment ['education'] is affected in the same way by common causes of ['education'] and ['income']
Estimand assumption 4, outcome_effect_homogeneity: Each unit's outcome ['income'] is affected in the same way by common causes of ['education'] and ['income']
Target units: ate
## Estimate
Mean value: nan
Version information:
DoWhy version [e.g. 0.12]
Additional context
I'm new to both the theory of structural causal modeling as well as the implementation in dowhy. Perhaps I'm wrong about this being estimable or perhaps there is a way to specify "unknown" variables in dowhy that I'm not aware of. Please forgive me in either case.
The text was updated successfully, but these errors were encountered:
While trying to understand the source code, I found the place where I believe we should be able to mask out missing values. I created a PR as a proof of concept - #1296
Ask your question
I couldn't find information on whether dowhy supports estimation with certain carefully chosen missing pieces of data. Consider the "voucher as instrumental variable" example from the tutorial. The problem assumes each row in the dataframe has values for all 3 observed variables (voucher, education, income). However, the estimand is
which means if I can estimate the association of voucher -> income and voucher -> education, I should be able to estimate the effect of education on income.
Expected behavior
If I construct a dataset as follows -
I expect us to be able to run the estimation procedure and get the correct result
But the estimate is nan
Version information:
Additional context
I'm new to both the theory of structural causal modeling as well as the implementation in dowhy. Perhaps I'm wrong about this being estimable or perhaps there is a way to specify "unknown" variables in dowhy that I'm not aware of. Please forgive me in either case.
The text was updated successfully, but these errors were encountered: