On the matrix square root via geometric optimization

doi:10.48550/arXiv.1507.08366

On the matrix square root via geometric optimization

Sra, Suvrit

This paper is triggered by the preprint "\emph{Computing Matrix Squareroot via Non Convex Local Search}" by Jain et al. (\textit{\textcolor{blue}{arXiv:1507.05854}}), which analyzes gradient-descent for computing the square root of a positive definite matrix. Contrary to claims of~\citet{jain2015}, our experiments reveal that Newton-like methods compute matrix square roots rapidly and reliably, even for highly ill-conditioned matrices and without requiring commutativity. We observe that gradient-descent converges very slowly primarily due to tiny step-sizes and ill-conditioning. We derive an alternative first-order method based on geodesic convexity: our method admits a transparent convergence analysis ($< 1$ page), attains linear rate, and displays reliable convergence even for rank deficient problems. Though superior to gradient-descent, ultimately our method is also outperformed by a well-known scaled Newton method. Nevertheless, the primary value of our work is its conceptual value: it shows that for deriving gradient based methods for the matrix square root, \emph{the manifold geometric view of positive definite matrices can be much more advantageous than the Euclidean view}.

Publication:

arXiv e-prints

Pub Date:

July 2015

DOI:

10.48550/arXiv.1507.08366

arXiv:

arXiv:1507.08366

Bibcode:

2015arXiv150708366S

Keywords:

Mathematics - Numerical Analysis;
Mathematics - Optimization and Control

E-Print:

8 pages, 12 plots, this version contains several more references and more words about the rank-deficient case

NASA/ADS

On the matrix square root via geometric optimization

Abstract