\[ \DeclareMathOperator{\E}{E} \DeclareMathOperator{\mean}{mean} \DeclareMathOperator{\Var}{Var} \DeclareMathOperator{\Cov}{Cov} \DeclareMathOperator{\Cor}{Cor} \DeclareMathOperator{\Bias}{Bias} \DeclareMathOperator{\MSE}{MSE} \DeclareMathOperator{\RMSE}{RMSE} \DeclareMathOperator{\sd}{sd} \DeclareMathOperator{\se}{se} \DeclareMathOperator{\rank}{rank} \DeclareMathOperator*{\argmin}{arg\,min} \DeclareMathOperator*{\argmax}{arg\,max} \newcommand{\Mat}[1]{\boldsymbol{#1}} \newcommand{\Vec}[1]{\boldsymbol{#1}} \newcommand{\T}{'} \newcommand{\distr}[1]{\mathcal{#1}} \newcommand{\dnorm}{\distr{N}} \newcommand{\dmvnorm}[1]{\distr{N}_{#1}} \newcommand{\dt}[1]{\distr{T}_{#1}} \newcommand{\cia}{\perp\!\!\!\perp} \DeclareMathOperator*{\plim}{plim} \]

Chapter 11 Regression Discontinuity

Summary: If there are thresholds whereby some observations receive the treatment above it, other those below it do not, and those immediately above or below that threshold are similar, we can use the difference of the outcome between those just above and those just below the threshold to estimate the causal effect of the treatment.

Suppose there is a running variable \(x\) such that any person receives the treatment, \(d\) if \(x \geq a\) and does not if \(x \leq a\), \[ d = \begin{cases} 1 & x \geq a \\ 0 & x < a \end{cases} \]

A simple regression discontinuity model is, \[ \begin{aligned}[t] y_i = \alpha + \beta x_i + \tau d_i + \gamma x_i d_i + \epsilon_i \end{aligned} \] The local causal effect of the treatment at the discontinuity is \(\tau\).

Fake Example of a Regression Discontinuity. The difference at the threshold (50) is the effect of the treatment.

Figure 11.1: Fake Example of a Regression Discontinuity. The difference at the threshold (50) is the effect of the treatment.

However, there are several choices

  • Functional form of the trends before and after the discontinuity
  • The size of the window of observations before and after the trend which to compare.

How to choose?

  • parametric: chooses specific functional forms
  • non-parametric: uses flexible forms, and chooses a bandwidth (Imbens and Kalyanaraman 2011)

Sharp vs. Fuzzy Discontinuity?

  • Sharp: the assignment of the treatment occurs with certainty at the threshold.
  • Fuzzy: the assignment of the treatment occurs only probabilistically at the threshold.

Suppose that the causal effect of treatment \(T \in \{0, 1\}\) on unit \(i\) is \(\tau_i = Y_i(1) - Y_i(0)\) where \(Y_i(1)\) is the potential outcome of \(i\) under the treatment and \(Y_i(0)\) is the potential outcome of \(i\) under the control. If potential outcomes are distributed smoothly at the cut-point \(c\), then the average causal effect of the treatment at the cut-point, \(Z_i = c\): \[ \tau_{RD} = \E[Y_{i}(1) - Y_i(0)| Z_i = c] = \lim_{Z_i \downarrow c}\E[Y_{i}(1) | Z_i = c] - \lim_{Z_i \uparrow c}\E[Y_i(0)| Z_i = c] \]

An advantage of RD designs is that unlike selection on observables or IV, its identifying assumptions are more observable and testable.

There are two basic tests (Lee and Lemieux (2010)):

  1. Continuity of pre-treatment covariates. E.g. density test of McCrary (2008). Whether the ratio of treated to control units departs from chance. A difficulty is that balance only holds in the limit, and covariance balance may still be present in finite samples.

  2. Irrelevance of covariates to the treatment-outcome relationship. There should be no systematic association between covariates and treatment, so controlling for them shouldn’t affect the estimates.

11.1 Examples

  • Thistlethwaite and Campbell (1960) was the first example of RD.

    • Outcome: Career choices in teaching
    • Running variable: PSAS scores
    • Cutoff: receiving National Merit Finalist
    • Discussed: Angrist and Pischke (2014 Ch 4)
  • Carpenter and Dobkin (2011), Carpenter and Dobkin (2009)

    • Running variable: age
    • Cutoff: ability to drink alcohol legally
    • Outcome: Death, accidents
    • Discussed: Angrist and Pischke (2014 Ch 4)
  • Abdulkadiroğlu, Angrist, and Pathak (2014)

    • Running variable: exam score
    • Cutoff: above threshold receive an offer from a school. This is fuzzy since not all those who receive the offer attend.
    • Outcome: Educational outcomes
    • Discussed: Angrist and Pischke (2014 Ch 4)
  • Eggers and Hainmueller (2009)

    • units: UK MPs
    • outcome: personal wealth
    • treatment: winning an election (holding office)
    • running variable: vote share
  • Litschig and Morrison (2013)

    • units: Brazilian municipalities
    • outcome: education, literacy, poverty rate
    • treatment: receiving a cash transfer from the central government (there are population cutoffs)
    • running variable: population
  • Gelman and Hill (2007, 213–17)

    • units: US Congressional members
    • outcome: ideology of representative
    • treatment: winning election
    • running variable: vote share
  • Gelman and Katz (2007), Gelman and Hill (2007, 232)

    • units: patients
    • outcome: length of hospital stay
    • treatment: new surgery method
    • cutoff: not performed on those over 80
    • running variable: age
    1. Also see derived examples in Bailey (2016 Ex. 6.3). See Button (2015) for a replication.
    • units: congressional districts
    • outcome: ideology of nominees
    • treatment: election
    • running variable: vote share
  • Jacob and Lefgren (2004)

    • units: students
    • outcome: education achievement
    • treatment: summer school, retention
    • running variable: standardized test

11.2 Example: Close Elections

A common use of RD in political science and econ is election outcomes. In this case the “treatment” is winning the election; it is applied to the candidate whose vote exceeds the threshold of 50%, but not to candidates arbitrarily below that threshold. Thus “close” elections are a common use of RD designs. This design was formalized in Lee (2008).

Several papers question whether close elections satisfy the assumptions of RD:

  • Caughey and Sekhon (2011) look at US House elections (1942-2008). They find that close elections are more imbalanced. They attribute this to national partisan waves.
  • Grimmer et al. (2011) look at all US House elections 1880-2008. They find that structurally advantaged candidates (strong party, incumbents) are more likely to win close elections.

The ways in which close elections can be non-random are lawsuit challenges and fraud.

Eggers et al. (2014) addresses these concerns with a systematic review of 40,000 close elections: “U.S. House in other time periods, statewide, state legislative, and mayoral races in the U.S. and national or local elections in nine other countries” Only the US House appears to have these issues.

11.3 Software

See the R packages

  • rddtools: a new and fairly complete package of regression discontinuity from primary data viz to other tests.
  • rdd
  • rdrobust: Tools for data-driven graphical and analytical statistical inference in RD.
  • rdpower: Calculate power for RD designs.
  • rdmulti: Analyze designs with multiple cutoffs.

See entries in the Econometrics task view.

11.4 References

Textbooks and Reviews:

  • Angrist and Pischke (2014 Ch. 4)
  • Gelman and Hill (2007 Sec. 10.4)
  • Bailey (2016 Ch. 11)
  • Linden, Adams, and Roberts (2006) for applications to medicine
  • Hahn, Todd, and Klaauw (2001) An early review of RD in economics

Methods:

  • Imbens and Kalyanaraman (2011) propose an optimal bandwidth selection method

References

Imbens, G., and K. Kalyanaraman. 2011. “Optimal Bandwidth Choice for the Regression Discontinuity Estimator.” The Review of Economic Studies 79 (3). Oxford University Press (OUP): 933–59. https://doi.org/10.1093/restud/rdr043.

Lee, David S, and Thomas Lemieux. 2010. “Regression Discontinuity Designs in Economics.” Journal of Economic Literature 48 (2). American Economic Association: 281–355. https://doi.org/10.1257/jel.48.2.281.

Thistlethwaite, Donald L., and Donald T. Campbell. 1960. “Regression-Discontinuity Analysis: An Alternative to the Ex Post Facto Experiment.” Journal of Educational Psychology 51 (6): 309–17. http://offcampus.lib.washington.edu/login?url=http://search.ebscohost.com/login.aspx?direct=true&db=pdh&AN=1962-00061-001&site=ehost-live.

Angrist, Joshua D., and Jörn-Steffen Pischke. 2009. Mostly Harmless Econometrics: An Empiricist’s Companion. Pr.

2014. Mastering ‘Metrics. Princeton UP.

Carpenter, Christopher, and Carlos Dobkin. 2011. “The Minimum Legal Drinking Age and Public Health.” Journal of Economic Perspectives 25 (2). American Economic Association: 133–56. https://doi.org/10.1257/jep.25.2.133.

Carpenter, Christopher, and Carlos Dobkin. 2009. “The Effect of Alcohol Consumption on Mortality: Regression Discontinuity Evidence from the Minimum Drinking Age.” American Economic Journal: Applied Economics 1 (1). American Economic Association: 164–82. https://doi.org/10.1257/app.1.1.164.

Abdulkadiroğlu, Atila, Joshua Angrist, and Parag Pathak. 2014. “The Elite Illusion: Achievement Effects at Boston and New York Exam Schools.” Econometrica 82 (1). The Econometric Society: 137–96. https://doi.org/10.3982/ecta10266.

Eggers, Andrew C., and Jens Hainmueller. 2009. “MPs for Sale? Returns to Office in Postwar British Politics.” American Political Science Review 103 (04). Cambridge University Press (CUP): 513–33. https://doi.org/10.1017/s0003055409990190.

Litschig, Stephan, and Kevin M. Morrison. 2013. “The Impact of Intergovernmental Transfers on Education Outcomes and Poverty Reduction.” American Economic Journal: Applied Economics 5 (4). American Economic Association: 206–40. https://doi.org/10.1257/app.5.4.206.

Gelman, Andrew, and Jennifer Hill. 2007. Data Analysis Using Regression and Multilevel/Hierarchical Models. Cambridge.

Gelman, Andrew, and Jonathan M. Katz. 2007. “Moderation in the Pursuit of Moderation Is No Vice: The Clear but Limited Advantages to Being a Moderate for Congressional Elections.” http://www.stat.columbia.edu/~gelman/research/unpublished/moderation5.pdf.

Bailey, Michael A. 2016. Real Stats: Using Econometrics for Political Science and Public Policy. Oxford University Press.

Button, Patrick. 2015. “A Replication of ‘Do Voters Affect or Elect Policies? Evidence from the U.S. House’ (The Quarterly Journal of Economics, 2004).” Working Paper 1518. Tulane University Department of Economics. http://econ.tulane.edu/RePEc/pdf/tul1518.pdf.

Jacob, Brian A., and Lars Lefgren. 2004. “Remedial Education and Student Achievement: A Regression-Discontinuity Analysis.” Review of Economics and Statistics 86 (1). MIT Press - Journals: 226–44. https://doi.org/10.1162/003465304323023778.

Lee, David S. 2008. “Randomized Experiments from Non-Random Selection in U.S. House Elections.” Journal of Econometrics 142 (2). Elsevier BV: 675–97. https://doi.org/10.1016/j.jeconom.2007.05.004.

Caughey, Devin, and Jasjeet S. Sekhon. 2011. “Elections and the Regression Discontinuity Design: Lessons from Close U.S. House Races, 1942-2008.” Political Analysis 19 (4). [Oxford University Press, Society for Political Methodology]: 385–408. http://www.jstor.org/stable/41403727.

Grimmer, Justin, Eitan Hersh, Brian Feinstein, and Daniel Carpenter. 2011. “Are Close Elections Random?” http://web.stanford.edu/~jgrimmer/CEF.pdf.

Eggers, Andrew C., Anthony Fowler, Jens Hainmueller, Andrew B. Hall, and James M. Snyder. 2014. “On the Validity of the Regression Discontinuity Design for Estimating Electoral Effects: New Evidence from over 40,000 Close Races.” American Journal of Political Science 59 (1). Wiley-Blackwell: 259–74. https://doi.org/10.1111/ajps.12127.

Linden, Ariel, John L. Adams, and Nancy Roberts. 2006. “Evaluating Disease Management Programme Effectiveness: An Introduction to the Regression Discontinuity Design.” Journal of Evaluation in Clinical Practice 12 (2). Wiley-Blackwell: 124–31. https://doi.org/10.1111/j.1365-2753.2005.00573.x.

Hahn, Jinyong, Petra Todd, and Wilbert Van der Klaauw. 2001. “Identification and Estimation of Treatment Effects with a Regression-Discontinuity Design.” Econometrica 69 (1). [Wiley, Econometric Society]: 201–9. http://www.jstor.org/stable/2692190.