Periodic Reporting for period 3 - RD-ADVANCE (Advancing Econometric Methods for Analyzing Data from Regression Discontinuity Designs)
Período documentado: 2021-09-01 hasta 2023-02-28
The goal of this project is to expand the tools available to researchers working with RD designs. It involves both methodological research in econometrics and statistics and practical work with real data. The project has developed more accurate and easy-to-use methods for estimation and inference, addressing common issues that affect the validity of RD analysis.
Part I focuses on covariates and group structures in RD designs. The research has led to two new approaches that improve estimation precision by making use of covariates with many dimensions. This is achieved by combining classical nonparametric regression techniques commonly used in the RD literature with modern machine learning methods that help tracing out the most relevant covariate information.
Part II develops methods for RD designs when the running variable takes on only a moderate number of distinct values. The research shows that the commonly used method of clustering standard errors at the level of the running variable to account for its discreteness has very poor properties and should not be used. Instead, the project recommends using recently proposed "bias-aware" methods that are well-suited for dealing with discrete covariates. The project also extends the "bias-aware" methodology fuzzy RD designs with discrete covariates and other forms of irregular support.
Part III introduces methods that consider the manipulation of the running variable in RD designs. The project has obtained new partial identification results for general models of manipulation and proposed flexible methods for estimating these bounds. Additionally, the project has made progress in understanding Donut RD designs, a widely used approach that involves excluding units near the cutoff to address concerns about manipulation of the running variable.
Given the popularity of RD designs and the project's practical focus, it has the potential to make a significant impact on empirical economic research in various policy-relevant areas such as education and public finance. Its findings can also benefit other fields like sociology or epidemiology, where researchers must commonly work with observational data to infer causal relationships. By expanding the range of available methods, this project contributes to advancing the field of economics and promoting progress in other scientific disciplines, ultimately leading to positive outcomes for society.
Part I focuses on covariates and group structures in RD designs. The research has led to two new approaches that improve estimation precision by making use of covariates with many dimensions. This is achieved in both cases by combining classical nonparametric regression techniques commonly used in the RD literature with modern machine learning methods that help tracing out the most relevant covariate information. Overall, Part I of the project has lead to to one published and one submitted paper by the end of the funding period. As covariate adjustments are widely used in empirical practice, the efficency gains of the new methods should have a meaningful impact on various applied fields.
Part II develops methods for RD designs when the running variable takes on only a moderate number of distinct values. The research shows that the commonly used method of clustering standard errors at the level of the running variable to account for its discreteness has very poor properties and should not be used. Instead, the project recommends using recently proposed "bias-aware" methods that are well-suited for dealing with discrete covariates. The project also extends the "bias-aware" methodology fuzzy RD designs with discrete covariates and other forms of irregular support. Overall, Part II of the project has lead to to one published and two submitted papers by the end of the funding period. The published work on the issues of the clustering approach is also already widely cited in the applied literature, and has changed how this type of empirical research is carried out.
Part III introduces methods that consider the manipulation of the running variable in RD designs. The project has obtained new partial identification results for general models of manipulation and proposed flexible methods for estimating these bounds. Additionally, the project has made progress in understanding Donut RD designs, a widely used approach that involves excluding units near the cutoff to address concerns about manipulation of the running variable. Overall, Part III of the project has lead to to one published and two submitted papers by the end of the funding period. This research has highlighted the differential impact of different assumptions about manipulation and the difficulties of "all purpose" models, and is thereby contributing to a growing theoretical literature that studies RD inference under manipulation of the running variable by more carefully modeling the manipulation process.