Skip to main content
European Commission logo
English English
CORDIS - EU research results
CORDIS
CORDIS Web 30th anniversary CORDIS Web 30th anniversary

Advancing Econometric Methods for Analyzing Data from Regression Discontinuity Designs

Periodic Reporting for period 3 - RD-ADVANCE (Advancing Econometric Methods for Analyzing Data from Regression Discontinuity Designs)

Reporting period: 2021-09-01 to 2023-02-28

A major concern in many social sciences is that it is often impossible to study the causal impact of a policy intervention or treatment through a randomized controlled trial. Instead, only observational are often available for this task. Over the past two decades, regression discontinuity (RD) designs have become widely used in empirical economics for estimating causal effects from observational data. In these designs, units are assigned to the treatment group based on whether a special covariate, called the running variable, exceeds a specific cutoff value. When certain conditions are met, units near the cutoff can be considered as if they were randomly assigned to receive the treatment. This simple approach helps identify the causal effect of the treatment.

The goal of this project is to expand the tools available to researchers working with RD designs. It involves both methodological research in econometrics and statistics and practical work with real data. The project has developed more accurate and easy-to-use methods for estimation and inference, addressing common issues that affect the validity of RD analysis.

Part I focuses on covariates and group structures in RD designs. The research has led to two new approaches that improve estimation precision by making use of covariates with many dimensions. This is achieved by combining classical nonparametric regression techniques commonly used in the RD literature with modern machine learning methods that help tracing out the most relevant covariate information.

Part II develops methods for RD designs when the running variable takes on only a moderate number of distinct values. The research shows that the commonly used method of clustering standard errors at the level of the running variable to account for its discreteness has very poor properties and should not be used. Instead, the project recommends using recently proposed "bias-aware" methods that are well-suited for dealing with discrete covariates. The project also extends the "bias-aware" methodology fuzzy RD designs with discrete covariates and other forms of irregular support.

Part III introduces methods that consider the manipulation of the running variable in RD designs. The project has obtained new partial identification results for general models of manipulation and proposed flexible methods for estimating these bounds. Additionally, the project has made progress in understanding Donut RD designs, a widely used approach that involves excluding units near the cutoff to address concerns about manipulation of the running variable.

Given the popularity of RD designs and the project's practical focus, it has the potential to make a significant impact on empirical economic research in various policy-relevant areas such as education and public finance. Its findings can also benefit other fields like sociology or epidemiology, where researchers must commonly work with observational data to infer causal relationships. By expanding the range of available methods, this project contributes to advancing the field of economics and promoting progress in other scientific disciplines, ultimately leading to positive outcomes for society.
The goal of this project is to expand the tools available to researchers working with RD designs. It is divided into three parts.

Part I focuses on covariates and group structures in RD designs. The research has led to two new approaches that improve estimation precision by making use of covariates with many dimensions. This is achieved in both cases by combining classical nonparametric regression techniques commonly used in the RD literature with modern machine learning methods that help tracing out the most relevant covariate information. Overall, Part I of the project has lead to to one published and one submitted paper by the end of the funding period. As covariate adjustments are widely used in empirical practice, the efficency gains of the new methods should have a meaningful impact on various applied fields.

Part II develops methods for RD designs when the running variable takes on only a moderate number of distinct values. The research shows that the commonly used method of clustering standard errors at the level of the running variable to account for its discreteness has very poor properties and should not be used. Instead, the project recommends using recently proposed "bias-aware" methods that are well-suited for dealing with discrete covariates. The project also extends the "bias-aware" methodology fuzzy RD designs with discrete covariates and other forms of irregular support. Overall, Part II of the project has lead to to one published and two submitted papers by the end of the funding period. The published work on the issues of the clustering approach is also already widely cited in the applied literature, and has changed how this type of empirical research is carried out.

Part III introduces methods that consider the manipulation of the running variable in RD designs. The project has obtained new partial identification results for general models of manipulation and proposed flexible methods for estimating these bounds. Additionally, the project has made progress in understanding Donut RD designs, a widely used approach that involves excluding units near the cutoff to address concerns about manipulation of the running variable. Overall, Part III of the project has lead to to one published and two submitted papers by the end of the funding period. This research has highlighted the differential impact of different assumptions about manipulation and the difficulties of "all purpose" models, and is thereby contributing to a growing theoretical literature that studies RD inference under manipulation of the running variable by more carefully modeling the manipulation process.
The project has substantially improved the scope of regression discontinuity analysis beyond the state of the art. By developing new methodology, the project's results allow applied researchers to conduct more precise and informative RD inference in some settings where such approaches were already used, but also made RD inference possible in some settings where it had so far been deemed not applicable. The project has combined different types of statistical techniques in new ways to achieve this progress, and thereby also made substantial contributions to the theory of RD inference.
outcome-1adjustmentterms.png