Assessing External Validity Over Worst-case Subpopulations

doi:10.48550/arXiv.2007.02411

Assessing External Validity Over Worst-case Subpopulations

Study populations are typically sampled from limited points in space and time, and marginalized groups are underrepresented. To assess the external validity of randomized and observational studies, we propose and evaluate the worst-case treatment effect (WTE) across all subpopulations of a given size, which guarantees positive findings remain valid over subpopulations. We develop a semiparametrically efficient estimator for the WTE that analyzes the external validity of the augmented inverse propensity weighted estimator for the average treatment effect. Our cross-fitting procedure leverages flexible nonparametric and machine learning-based estimates of nuisance parameters and is a regular root-$n$ estimator even when nuisance estimates converge more slowly. On real examples where external validity is of core concern, our proposed framework guards against brittle findings that are invalidated by unanticipated population shifts.

Publication:

arXiv e-prints

Pub Date:

July 2020

DOI:

10.48550/arXiv.2007.02411

arXiv:

arXiv:2007.02411

Bibcode:

2020arXiv200702411J

Keywords:

Statistics - Machine Learning;
Computer Science - Machine Learning;
Economics - Econometrics

E-Print:

A previous version of the paper circulated under the title "Robust Causal Inference Under Covariate Shift via Worst-Case Subpopulation Treatment Effects" appeared in COLT 2020

NASA/ADS

Assessing External Validity Over Worst-case Subpopulations

Abstract