Assessing External Validity Over Worst-case Subpopulations
Abstract
Study populations are typically sampled from limited points in space and time, and marginalized groups are underrepresented. To assess the external validity of randomized and observational studies, we propose and evaluate the worst-case treatment effect (WTE) across all subpopulations of a given size, which guarantees positive findings remain valid over subpopulations. We develop a semiparametrically efficient estimator for the WTE that analyzes the external validity of the augmented inverse propensity weighted estimator for the average treatment effect. Our cross-fitting procedure leverages flexible nonparametric and machine learning-based estimates of nuisance parameters and is a regular root-$n$ estimator even when nuisance estimates converge more slowly. On real examples where external validity is of core concern, our proposed framework guards against brittle findings that are invalidated by unanticipated population shifts.
- Publication:
-
arXiv e-prints
- Pub Date:
- July 2020
- DOI:
- 10.48550/arXiv.2007.02411
- arXiv:
- arXiv:2007.02411
- Bibcode:
- 2020arXiv200702411J
- Keywords:
-
- Statistics - Machine Learning;
- Computer Science - Machine Learning;
- Economics - Econometrics
- E-Print:
- A previous version of the paper circulated under the title "Robust Causal Inference Under Covariate Shift via Worst-Case Subpopulation Treatment Effects" appeared in COLT 2020