Distribution-on-Distribution Regression via Optimal Transport Maps
Abstract
We present a framework for performing regression when both covariate and response are probability distributions on a compact interval $\Omega\subset\mathbb{R}$. Our regression model is based on the theory of optimal transportation and links the conditional Fréchet mean of the response distribution to the covariate distribution via an optimal transport map. We define a Fréchet-least-squares estimator of this regression map, and establish its consistency and rate of convergence to the true map, under both full and partial observation of the regression pairs. Computation of the estimator is shown to reduce to an isotonic regression problem, and thus our regression model can be implemented with ease. We illustrate our methodology using real and simulated data.
- Publication:
-
arXiv e-prints
- Pub Date:
- April 2021
- DOI:
- 10.48550/arXiv.2104.09418
- arXiv:
- arXiv:2104.09418
- Bibcode:
- 2021arXiv210409418G
- Keywords:
-
- Statistics - Methodology
- E-Print:
- to appear in Biometrika