DiD: allow for multiple pre and post intervention observations #76

drbenvincent · 2022-11-19T13:51:02Z

… a named variable

drbenvincent · 2022-11-19T19:26:57Z

Model formula will be bib ~ 1 + district + year + district:treated

drbenvincent · 2022-11-19T21:22:12Z

At this point it seems that model fitting does work, although we'll have to wait for the visualisation to see if the results make sense.

Plotting is going to get more complex. The initial plot that I had only makes sense when we have two time points (pre/post) AND we have multiple observed units per group.

So we've potentially got a 2*2 grid of different plot types to produce

	pre/post only	multiple time points
one unit observed per group
multiple units per group	existing plot

drbenvincent · 2022-11-19T21:32:49Z

fix green arrow for causal impact
fix plotting of data

drbenvincent · 2022-11-19T21:34:27Z

The magnitude of the causal impact is wrong. I think this might be fixed by enforcing an order on the levels of the groups.

drbenvincent · 2022-11-20T12:32:31Z

Plotting now works for the original dataset (multiple units in the treatment and control conditions) and the new banking example (one unit in the treatment and one in the control condition).

Although the aesthetics could do with some work, the priority is to focus on getting it working in the case where we have more observations over time, not just the pre/post times.

And the inferences are particularly bad. But this is because we are just using whatever default priors at the moment and because we only have one observation per condition. This is not an ideal scenario as a lot rides on the sigma parameter. But this can be worked on when we use Bambi (see #22).

Original dataset

Banking dataset

drbenvincent · 2022-11-20T12:59:26Z

Currently sampling from the posterior works with

result = DifferenceInDifferences(
    df_long,
    formula="bib ~ 1 + district + year + district:treated",
    time_variable_name="year",
    group_variable_name="district",
    treated="Sixth District",
    untreated="Eighth District",
    prediction_model=LinearRegression()
    )

But breaks when we get to doing the other stuff and expected. This is the next step.

drbenvincent · 2022-12-25T18:40:21Z

Currently working in the did_multiple_observations branch.

drbenvincent · 2022-12-25T21:34:06Z

At this point we have DiD working for:

Single pre and post treatment observation (although we have some shape issues, calculating posterior predictions and counterfactuals for each item)

But there are clearly some issues to be resolved for the banks dataset.

Multiple pre and post treatment observations.

Looks like this for the full banks dataset

drbenvincent · 2022-12-25T22:03:46Z

In the banks dataset, we are getting 1 surplus degree of freedom. I think it is because the group is coded as a string (therefore treated as a category) rather than as a numerical 0/1.

drbenvincent · 2022-12-26T16:59:18Z

I've made meaningful improvements to DiD at this point. There were a number of things about the code before which were muddled and a bit wrong. I've fixed those up, made the code cleaner, got DiD working for multiple pre and post treatment observations, and improved the plotting.

drbenvincent added enhancement New feature or request plotting Improve or fix plotting outputs Quantitative outputs of the model labels Nov 19, 2022

drbenvincent added a commit that referenced this issue Nov 19, 2022

#76 #44 add data + notebook with data import + time_variable_name now…

1831dc4

… a named variable

drbenvincent added a commit that referenced this issue Nov 19, 2022

#76 #44 progress on banking example data processing

10a9c82

drbenvincent added a commit that referenced this issue Nov 19, 2022

#76 #44 add banking notebooks to examples.rst

d348ee8

drbenvincent added a commit that referenced this issue Nov 19, 2022

#76 #44 DID now works generalises to custom varnames + level values

7840611

drbenvincent added a commit that referenced this issue Nov 19, 2022

#76 #44 start to fix did plot

90cc898

drbenvincent added a commit that referenced this issue Nov 20, 2022

#76 improve DID plotting + improve data pre-processing

b1310a6

drbenvincent added a commit that referenced this issue Nov 20, 2022

#76 add input validation

e33ce25

drbenvincent added a commit that referenced this issue Nov 23, 2022

#76 #44 tweaks to DID banks example

648c498

drbenvincent mentioned this issue Nov 23, 2022

Improvements to DID #85

Merged

drbenvincent mentioned this issue Dec 4, 2022

Add examples for 'classic' causal inference datasets #44

Open

13 tasks

drbenvincent added this to the Stabilise the feature set milestone Dec 5, 2022

drbenvincent added a commit that referenced this issue Dec 25, 2022

#76 DiD now 'works' for multiple pre/post treatment observations

a0faff9

drbenvincent added a commit that referenced this issue Dec 25, 2022

#76 DiD tests now pass

0ab2fcf

drbenvincent added a commit that referenced this issue Dec 25, 2022

#76 add full banks example to the integration tests

bcf38f9

drbenvincent added a commit that referenced this issue Dec 26, 2022

#76 fix tests which strangely warn locally but fail remotely

0ed7240

drbenvincent added a commit that referenced this issue Dec 26, 2022

#76 stop evaluating for multiple units per time point

e3bc8cd

drbenvincent added a commit that referenced this issue Dec 26, 2022

#76 improve DiD plotting + rerun notebooks

36b4511

drbenvincent mentioned this issue Dec 26, 2022

DiD: allow multiple pre/post intervention observations + correctness fixes #140

Merged

drbenvincent closed this as completed in #140 Dec 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DiD: allow for multiple pre and post intervention observations #76

DiD: allow for multiple pre and post intervention observations #76

drbenvincent commented Nov 19, 2022 •

edited

Loading

drbenvincent commented Nov 19, 2022

drbenvincent commented Nov 19, 2022

drbenvincent commented Nov 19, 2022 •

edited

Loading

drbenvincent commented Nov 19, 2022

drbenvincent commented Nov 20, 2022 •

edited

Loading

drbenvincent commented Nov 20, 2022

drbenvincent commented Dec 25, 2022

drbenvincent commented Dec 25, 2022

drbenvincent commented Dec 25, 2022

drbenvincent commented Dec 26, 2022

DiD: allow for multiple pre and post intervention observations #76

DiD: allow for multiple pre and post intervention observations #76

Comments

drbenvincent commented Nov 19, 2022 • edited Loading

Bank failure dataset/example + robustifying

Classic 2x2 DID

'Extended' DID with more than 2 observed time points

drbenvincent commented Nov 19, 2022

drbenvincent commented Nov 19, 2022

drbenvincent commented Nov 19, 2022 • edited Loading

drbenvincent commented Nov 19, 2022

drbenvincent commented Nov 20, 2022 • edited Loading

drbenvincent commented Nov 20, 2022

drbenvincent commented Dec 25, 2022

drbenvincent commented Dec 25, 2022

drbenvincent commented Dec 25, 2022

drbenvincent commented Dec 26, 2022

drbenvincent commented Nov 19, 2022 •

edited

Loading

drbenvincent commented Nov 19, 2022 •

edited

Loading

drbenvincent commented Nov 20, 2022 •

edited

Loading