Suppose we have a causal model:
\[ Y = \mu + \alpha D + \eta \]
where \(D\) is the policy or treatment of interest. We are unwilling to assume that \(D\) is randomly assigned, so
\[ \mathbb{E}[\eta | D] = 0 \]
is a bad assumption. Where to next?
We can use the variation in \(D\) that is induced by \(Z_{1}\) or \(Z_{2}\) (or both) to estimate \(\alpha\)
We can use the variation in \(D\) that is induced by \(Z\) to estimate \(\alpha\)
\[ \log(Q_{D}) = \alpha_{0} - \alpha_{1}\log(P) + \eta_{D} \]
And
\[ \log(Q_{S}) = \gamma_{0} + \gamma_{1}\log(P) + \gamma_{2}Z + \eta_{S} \]
We can use the variation in \(P\) that is induced by \(Z\) to estimate \(\alpha_{1}\)
Exercise: \[ \alpha = \frac{\mathbb{C}(Y,Z)}{\mathbb{C}(D,Z)} \]
So an estimator is:
\[ \hat{\alpha} = \frac{\widehat{\mathbb{C}(Y,Z)}}{\widehat{\mathbb{C}(D,Z)}} = \frac{\sum_{n}\hat{Y}_{n}\hat{Z}_{n}}{\sum_{n}\hat{D}_{n}\hat{Z}_{n}} \]
where \(\hat{Y} = Y - \overline{Y}\).
Unbiased? Consistent? Asymptotically normal?
Exercise: \[ \alpha = \frac{\mathbb{C}(Y,Z)}{\mathbb{C}(D,Z)} = \frac{\mathbb{E}[Y|Z=1] - \mathbb{E}[Y|Z=0]}{\mathbb{E}[D|Z=1] - \mathbb{E}[D|Z=0]} \]
So one estimator is:
\[ \hat{\alpha} = \frac{\overline{Y}_{1} - \overline{Y}_{0}}{\overline{D}_{1}-\overline{D}_{0}} \]
\[ \alpha_{1} = -\frac{\mathbb{C}(\log(Q),Z)}{\mathbb{C}(\log(P),Z)} \]
Exercise: show consistent, asymptotically normal, biased.
Now we get:
\[ \mathbb{E}[Y|Z,\text{zip}] = \mu + \alpha\mathbb{E}[D|Z,\text{zip}] + \mathbb{E}[\eta|\text{zip}] \]
Giving a system:
\[ Y = \mu + \gamma\text{zip} + \alpha D + \epsilon,\qquad \mathbb{E}[\epsilon|Z_{2},\text{zip}] = 0 \]
\[ D = \pi_{0} + \pi_{1}Z_{2} + \pi_{2}\text{zip} + \varepsilon \]
And we run 2SLS on this system. We need college openings over time to identify the model.
Take the system:
\[ Y = X\beta + \eta \]
and
\[ X = Z\pi + \varepsilon \]
where \(X = [W,\ D]\) includes both controls and the treatment of interest and \(Z\) includes the controls plus all instruments.
The 2SLS estimator is:
\[ \hat{\beta}_{IV} = (\hat{\mathbf{X}}^\prime \hat{\mathbf{X}})^{-1}\hat{\mathbf{X}}^\prime \mathbf{Y} \]
with \(\hat{\mathbf{X}} = \mathbf{Z}(\mathbf{Z}'\mathbf{Z})^{-1}\mathbf{Z}'\mathbf{X} = \mathbf{Z}\hat{\pi}\)
Exercises:
Define \(\mathbb{E}[X'Z]=\mathbf{Q}_{XZ}\), \(\mathbb{E}[Z'Z]=\mathbf{Q}_{ZZ}\) and so forth. Here are the assumptions we were using for the previous exercise:
As before, A5 only matters for the calculation of standard errors.
Let \(V_{\beta} = \mathbb{V}[\hat{\beta}_{2SLS}]\). We can estimate \(V_\beta\).
Without A5 we have: \[V_\beta = \frac{1}{N}\mathbf{A}\mathbb{E}[Z'Z\eta^2]\mathbf{A}',\qquad \mathbf{A}=(\mathbf{Q}_{XZ}\mathbf{Q}_{ZZ}^{-1}\mathbf{Q}_{XZ}')^{-1}\mathbf{Q}_{XZ}\mathbf{Q}^{-1}_{ZZ} \] which we estimate as: \[\hat{V}_\beta = \frac{1}{N}\hat{\mathbf{A}}\left(\frac{1}{N}\sum_{i=1}^NZ_n'Z_n\hat{\eta}_n^2\right) \hat{\mathbf{A}}' \]
“Mother’s Education and the Intergenerational Transmission of Human Capital: Evidence from College Openings” - Currie & Moretti, 2003
The county/year effects control for many characteristics of the local area that may affect outcomes, such as the availability and quality of medical services, the local business cycle, pollution, etc. Identification in our models comes from the fact that *within* each county and year of birth of the baby, there are mothers who were seventeen before a college opening, and mothers who were seventeen after a college opening.
Finding: higher maternal ed. increases probability of marriage, increases use of prenatal care.
“The Effects of School Spending on Educational and Economic Outcomes: Evidence from School Finance Reforms” - Jackson et al, 2016
Stage 1: \[ \begin{multline} \ln(PPE_{5-17})_{idb} = \pi_1(\text{Exp}_{idb}\times\text{Dosage}_d) + \pi_2 \text{Exp}_{idb} \\ + \Pi C_{idb} + \rho_d + \rho_b + \zeta_{idb} \nonumber \end{multline} \]
Stage 2: \[ Y_{idb} = \delta \widehat{\ln(PPE_{5-17})_{idb}} + \Phi C_{idb} + \theta_d + \theta_b + \varepsilon_{idb} \]
Finding: a 10% increase in spending in each of 12 years leads to:
Much larger for children of poor families: 25% increase can eliminate educational attainment gap between poor and non-poor.