Anyone interested in practicing hands-on causal inference?

rahulB · December 18, 2022, 3:56pm

Hi All,

I was thinking of getting some hands-on experience in causal inference by applying some of the learnings on real-world datasets. There are some good open and anonymized datasets from companies available on scikit-uplift package website here. The X5_RetailHero in particular looks really interesting. Moreover, this dataset was a part of a competition held some 2 years ago.
Would anyone be interested in trying this out in the next few weeks before beginning with the new topic?

RavinKumar · December 18, 2022, 4:17pm

This is a great idea. More hands on practice will reinforce the concepts we’ve learned. I’ve been kicking around some ideas as well.

What format were you thinking?

ChadDelany · December 18, 2022, 5:35pm

I’m very interested. I’ve been looking for different datasets to do some diff-in-diff analysis on.

rahulB · December 18, 2022, 8:04pm

These datasets are obtained from randomized control experiments and as such one can simply calculate ATE = Y_1 - Y_0. But, we can treat these as observational studies and apply the causal inference methods such as matching, sub-classification, DiD etc. to see how close we can come to the real ATE. We can also compute heterogeneous treatment effects for individual users. I was thinking of

Manually computing ATE, ATT using matching methods or IPW etc.
Using the libraries dowhy and econml from Microsoft to compare the values calculated manually.
Compare methods to see which one does the best.

I do not have a strong opinion on the format. I was thinking of starting a public git repo and put my code in a folder under my name. Others can refer it, or, fork the repo and create a folder under their name and subsequently create a pull request to main. This way all code is in one place and everyone can refer. But, please feel free to suggest other methods that you think would be better.
It would be interesting to talk about the methods others apply (there is always some degree of subjectivity to causal analysis) and then talk about the results in the next 2-3 weeks time.

Let me know how this sounds and any suggestions or comments are welcome.

Topic		Replies	Views
New Bayesian Causal Inference package Causal Inference Book Club	1	351	December 20, 2022
Microsoft transitioning Causal Inference library to Open Source community Causal Inference Book Club	3	310	June 8, 2022
Interview with Scott on Jan 8th. Post your questions here Causal Inference Book Club	6	774	January 8, 2023
Causal Inference Bookshelf Causal Inference Book Club	9	601	March 30, 2024
What are you looking to get out of this? Share your thoughts! Causal Inference Book Club	18	360	June 15, 2022

Anyone interested in practicing hands-on causal inference?

Related topics