Chapter 11 Regression Discontinuity
Summary: If there are thresholds whereby some observations receive the treatment above it, other those below it do not, and those immediately above or below that threshold are similar, we can use the difference of the outcome between those just above and those just below the threshold to estimate the causal effect of the treatment.
Suppose there is a running variable \(x\) such that any person receives the treatment, \(d\) if \(x \geq a\) and does not if \(x \leq a\), \[ d = \begin{cases} 1 & x \geq a \\ 0 & x < a \end{cases} \]
A simple regression discontinuity model is, \[ \begin{aligned}[t] y_i = \alpha + \beta x_i + \tau d_i + \gamma x_i d_i + \epsilon_i \end{aligned} \] The local causal effect of the treatment at the discontinuity is \(\tau\).
However, there are several choices
 Functional form of the trends before and after the discontinuity
 The size of the window of observations before and after the trend which to compare.
How to choose?
 parametric: chooses specific functional forms
 nonparametric: uses flexible forms, and chooses a bandwidth (Imbens and Kalyanaraman 2011)
Sharp vs. Fuzzy Discontinuity?
 Sharp: the assignment of the treatment occurs with certainty at the threshold.
 Fuzzy: the assignment of the treatment occurs only probabilistically at the threshold.
Suppose that the causal effect of treatment \(T \in \{0, 1\}\) on unit \(i\) is \(\tau_i = Y_i(1)  Y_i(0)\) where \(Y_i(1)\) is the potential outcome of \(i\) under the treatment and \(Y_i(0)\) is the potential outcome of \(i\) under the control. If potential outcomes are distributed smoothly at the cutpoint \(c\), then the average causal effect of the treatment at the cutpoint, \(Z_i = c\): \[ \tau_{RD} = \E[Y_{i}(1)  Y_i(0) Z_i = c] = \lim_{Z_i \downarrow c}\E[Y_{i}(1)  Z_i = c]  \lim_{Z_i \uparrow c}\E[Y_i(0) Z_i = c] \]
An advantage of RD designs is that unlike selection on observables or IV, its identifying assumptions are more observable and testable.
There are two basic tests (Lee and Lemieux (2010)):
Continuity of pretreatment covariates. E.g. density test of McCrary (2008). Whether the ratio of treated to control units departs from chance. A difficulty is that balance only holds in the limit, and covariance balance may still be present in finite samples.
Irrelevance of covariates to the treatmentoutcome relationship. There should be no systematic association between covariates and treatment, so controlling for them shouldn’t affect the estimates.
11.1 Examples
Thistlethwaite and Campbell (1960) was the first example of RD.
 Outcome: Career choices in teaching
 Running variable: PSAS scores
 Cutoff: receiving National Merit Finalist
 Discussed: Angrist and Pischke (2014 Ch 4)
 Outcome: Career choices in teaching
Carpenter and Dobkin (2011), Carpenter and Dobkin (2009)
 Running variable: age
 Cutoff: ability to drink alcohol legally
 Outcome: Death, accidents
 Discussed: Angrist and Pischke (2014 Ch 4)
Abdulkadiroğlu, Angrist, and Pathak (2014)
 Running variable: exam score
 Cutoff: above threshold receive an offer from a school. This is fuzzy since not all those who receive the offer attend.
 Outcome: Educational outcomes
 Discussed: Angrist and Pischke (2014 Ch 4)
Eggers and Hainmueller (2009)
 units: UK MPs
 outcome: personal wealth
 treatment: winning an election (holding office)
 running variable: vote share
Litschig and Morrison (2013)
 units: Brazilian municipalities
 outcome: education, literacy, poverty rate
 treatment: receiving a cash transfer from the central government (there are population cutoffs)
 running variable: population
Gelman and Hill (2007, 213–17)
 units: US Congressional members
 outcome: ideology of representative
 treatment: winning election
 running variable: vote share
Gelman and Katz (2007), Gelman and Hill (2007, 232)
 units: patients
 outcome: length of hospital stay
 treatment: new surgery method
 cutoff: not performed on those over 80
 running variable: age

 units: congressional districts
 outcome: ideology of nominees
 treatment: election
 running variable: vote share
Jacob and Lefgren (2004)
 units: students
 outcome: education achievement
 treatment: summer school, retention
 running variable: standardized test
11.2 Example: Close Elections
A common use of RD in political science and econ is election outcomes. In this case the “treatment” is winning the election; it is applied to the candidate whose vote exceeds the threshold of 50%, but not to candidates arbitrarily below that threshold. Thus “close” elections are a common use of RD designs. This design was formalized in Lee (2008).
Several papers question whether close elections satisfy the assumptions of RD:
 Caughey and Sekhon (2011) look at US House elections (19422008). They find that close elections are more imbalanced. They attribute this to national partisan waves.
 Grimmer et al. (2011) look at all US House elections 18802008. They find that structurally advantaged candidates (strong party, incumbents) are more likely to win close elections.
The ways in which close elections can be nonrandom are lawsuit challenges and fraud.
Eggers et al. (2014) addresses these concerns with a systematic review of 40,000 close elections: “U.S. House in other time periods, statewide, state legislative, and mayoral races in the U.S. and national or local elections in nine other countries” Only the US House appears to have these issues.
11.3 Software
See the R packages
 rddtools: a new and fairly complete package of regression discontinuity from primary data viz to other tests.
 rdd
 rdrobust: Tools for datadriven graphical and analytical statistical inference in RD.
 rdpower: Calculate power for RD designs.
 rdmulti: Analyze designs with multiple cutoffs.
See entries in the Econometrics task view.
11.4 References
Textbooks and Reviews:
 Angrist and Pischke (2014 Ch. 4)
 Gelman and Hill (2007 Sec. 10.4)
 Bailey (2016 Ch. 11)
 Linden, Adams, and Roberts (2006) for applications to medicine
 Hahn, Todd, and Klaauw (2001) An early review of RD in economics
Methods:
 Imbens and Kalyanaraman (2011) propose an optimal bandwidth selection method
References
Imbens, G., and K. Kalyanaraman. 2011. “Optimal Bandwidth Choice for the Regression Discontinuity Estimator.” The Review of Economic Studies 79 (3). Oxford University Press (OUP): 933–59. https://doi.org/10.1093/restud/rdr043.
Lee, David S, and Thomas Lemieux. 2010. “Regression Discontinuity Designs in Economics.” Journal of Economic Literature 48 (2). American Economic Association: 281–355. https://doi.org/10.1257/jel.48.2.281.
Thistlethwaite, Donald L., and Donald T. Campbell. 1960. “RegressionDiscontinuity Analysis: An Alternative to the Ex Post Facto Experiment.” Journal of Educational Psychology 51 (6): 309–17. http://offcampus.lib.washington.edu/login?url=http://search.ebscohost.com/login.aspx?direct=true&db=pdh&AN=196200061001&site=ehostlive.
Angrist, Joshua D., and JörnSteffen Pischke. 2009. Mostly Harmless Econometrics: An Empiricist’s Companion. Pr.
2014. Mastering ‘Metrics. Princeton UP.Carpenter, Christopher, and Carlos Dobkin. 2011. “The Minimum Legal Drinking Age and Public Health.” Journal of Economic Perspectives 25 (2). American Economic Association: 133–56. https://doi.org/10.1257/jep.25.2.133.
Carpenter, Christopher, and Carlos Dobkin. 2009. “The Effect of Alcohol Consumption on Mortality: Regression Discontinuity Evidence from the Minimum Drinking Age.” American Economic Journal: Applied Economics 1 (1). American Economic Association: 164–82. https://doi.org/10.1257/app.1.1.164.
Abdulkadiroğlu, Atila, Joshua Angrist, and Parag Pathak. 2014. “The Elite Illusion: Achievement Effects at Boston and New York Exam Schools.” Econometrica 82 (1). The Econometric Society: 137–96. https://doi.org/10.3982/ecta10266.
Eggers, Andrew C., and Jens Hainmueller. 2009. “MPs for Sale? Returns to Office in Postwar British Politics.” American Political Science Review 103 (04). Cambridge University Press (CUP): 513–33. https://doi.org/10.1017/s0003055409990190.
Litschig, Stephan, and Kevin M. Morrison. 2013. “The Impact of Intergovernmental Transfers on Education Outcomes and Poverty Reduction.” American Economic Journal: Applied Economics 5 (4). American Economic Association: 206–40. https://doi.org/10.1257/app.5.4.206.
Gelman, Andrew, and Jennifer Hill. 2007. Data Analysis Using Regression and Multilevel/Hierarchical Models. Cambridge.
Gelman, Andrew, and Jonathan M. Katz. 2007. “Moderation in the Pursuit of Moderation Is No Vice: The Clear but Limited Advantages to Being a Moderate for Congressional Elections.” http://www.stat.columbia.edu/~gelman/research/unpublished/moderation5.pdf.
Bailey, Michael A. 2016. Real Stats: Using Econometrics for Political Science and Public Policy. Oxford University Press.
Button, Patrick. 2015. “A Replication of ‘Do Voters Affect or Elect Policies? Evidence from the U.S. House’ (The Quarterly Journal of Economics, 2004).” Working Paper 1518. Tulane University Department of Economics. http://econ.tulane.edu/RePEc/pdf/tul1518.pdf.
Jacob, Brian A., and Lars Lefgren. 2004. “Remedial Education and Student Achievement: A RegressionDiscontinuity Analysis.” Review of Economics and Statistics 86 (1). MIT Press  Journals: 226–44. https://doi.org/10.1162/003465304323023778.
Lee, David S. 2008. “Randomized Experiments from NonRandom Selection in U.S. House Elections.” Journal of Econometrics 142 (2). Elsevier BV: 675–97. https://doi.org/10.1016/j.jeconom.2007.05.004.
Caughey, Devin, and Jasjeet S. Sekhon. 2011. “Elections and the Regression Discontinuity Design: Lessons from Close U.S. House Races, 19422008.” Political Analysis 19 (4). [Oxford University Press, Society for Political Methodology]: 385–408. http://www.jstor.org/stable/41403727.
Grimmer, Justin, Eitan Hersh, Brian Feinstein, and Daniel Carpenter. 2011. “Are Close Elections Random?” http://web.stanford.edu/~jgrimmer/CEF.pdf.
Eggers, Andrew C., Anthony Fowler, Jens Hainmueller, Andrew B. Hall, and James M. Snyder. 2014. “On the Validity of the Regression Discontinuity Design for Estimating Electoral Effects: New Evidence from over 40,000 Close Races.” American Journal of Political Science 59 (1). WileyBlackwell: 259–74. https://doi.org/10.1111/ajps.12127.
Linden, Ariel, John L. Adams, and Nancy Roberts. 2006. “Evaluating Disease Management Programme Effectiveness: An Introduction to the Regression Discontinuity Design.” Journal of Evaluation in Clinical Practice 12 (2). WileyBlackwell: 124–31. https://doi.org/10.1111/j.13652753.2005.00573.x.
Hahn, Jinyong, Petra Todd, and Wilbert Van der Klaauw. 2001. “Identification and Estimation of Treatment Effects with a RegressionDiscontinuity Design.” Econometrica 69 (1). [Wiley, Econometric Society]: 201–9. http://www.jstor.org/stable/2692190.