Near-optimal Coresets For Least-Squares Regression
Abstract
We study (constrained) least-squares regression as well as multiple response least-squares regression and ask the question of whether a subset of the data, a coreset, suffices to compute a good approximate solution to the regression. We give deterministic, low order polynomial-time algorithms to construct such coresets with approximation guarantees, together with lower bounds indicating that there is not much room for improvement upon our results.
- Publication:
-
arXiv e-prints
- Pub Date:
- February 2012
- DOI:
- 10.48550/arXiv.1202.3505
- arXiv:
- arXiv:1202.3505
- Bibcode:
- 2012arXiv1202.3505B
- Keywords:
-
- Computer Science - Data Structures and Algorithms;
- Computer Science - Machine Learning
- E-Print:
- To appear in IEEE Transactions on Information Theory