Slow Kill for Big Data Learning

doi:10.48550/arXiv.2305.01726

Slow Kill for Big Data Learning

Big-data applications often involve a vast number of observations and features, creating new challenges for variable selection and parameter estimation. This paper presents a novel technique called ``slow kill,'' which utilizes nonconvex constrained optimization, adaptive $\ell_2$-shrinkage, and increasing learning rates. The fact that the problem size can decrease during the slow kill iterations makes it particularly effective for large-scale variable screening. The interaction between statistics and optimization provides valuable insights into controlling quantiles, stepsize, and shrinkage parameters in order to relax the regularity conditions required to achieve the desired level of statistical accuracy. Experimental results on real and synthetic data show that slow kill outperforms state-of-the-art algorithms in various situations while being computationally efficient for large-scale data.

Publication:

arXiv e-prints

Pub Date:

May 2023

DOI:

10.48550/arXiv.2305.01726

arXiv:

arXiv:2305.01726

Bibcode:

2023arXiv230501726S

Keywords:

Statistics - Machine Learning;
Computer Science - Machine Learning;
Statistics - Computation;
Statistics - Methodology

NASA/ADS

Slow Kill for Big Data Learning

Abstract