NSW study code replication?

rahulB · August 2, 2022, 11:05pm

Has anyone tried to replicate the steps for the NSW study? I am not able to get the propensity scores in the same range as shown in the table 5.15. I am getting the following instead.

Also, the Figure 5.3 comes out to be like this

Let me know if anyone was able to replicate it.

Thanks

rahulB · August 3, 2022, 5:41pm

Do you think the number of Treatment Vs non-treatment samples should also balanced. Though the book does not talk about it with respect to the NSW study. In this case the CPS samples are far greater in number when compared to the NSW study and a very small proportion of the individuals would be considered eligible to be included in the NSW program (so, mostly non-treated units). So, does this cause the model to predict almost everyone in the joint dataframe as belonging to the control group - just because of the sheer number of non-treated units?

RavinKumar · August 4, 2022, 2:18am

I havent run the code but checking the additional python reference I’m using when reading this book your histograms look similar. My guess is just that the bin size is different leading to very different plots.

github.com

tomcaputo/mixtape_learnr/blob/main/Python/Matching_and_Subclassification.ipynb

{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Welcome\n",
    "\n",
    "This is material for the **Matching and Subclassification** chapter in Scott Cunningham's book, [Causal Inference: The Mixtape.](https://mixtape.scunning.com/)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
   "metadata": {},
   "outputs": [],
   "source": [
    "import pandas as pd\n",
    "import numpy as np\n",
    "import plotnine as p\n",

This file has been truncated. show original

That’s not how regression should work with OLS. OLS regression should return the same coefficients.

rahulB · August 4, 2022, 11:04pm

Thanks for your answer and sharing the git repo. I do have a questions though:

The range of propensity scores for the notebook you shared and what I got are roughly the same. But, both of these are very different from The mixtape book. Am I missing anything?

RavinKumar · August 4, 2022, 11:19pm

I noticed that as well. To be honest I also don’t know, i don’t know how to effectively read Stata code alone run it…

Sorry I can’t give you a better answer. I hope someone else able to chime in

Topic		Replies	Views
When propensity scores might not be the right tool Matching and Subclassification	1	307	September 24, 2022
Matching on pre-treatment outcome variable Matching and Subclassification	1	257	November 16, 2022
Additional free python resources on causal inference Causal Inference Book Club	0	305	July 21, 2022
Prob and Regression Hands on Exercse Probability and Regression Review	0	251	June 27, 2022
Introduction Chapter Livestream Details and Q&A Introduction	8	672	June 14, 2022

NSW study code replication?

Related topics