1

[2312.08893] Solving Dense Linear Systems Faster than via Preconditioning

 1 month ago
source link: https://arxiv.org/abs/2312.08893
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

Computer Science > Data Structures and Algorithms

[Submitted on 14 Dec 2023]

Solving Dense Linear Systems Faster than via Preconditioning

View PDF HTML (experimental)

We give a stochastic optimization algorithm that solves a dense n×n real-valued linear system Ax=b, returning x~ such that ∥Ax~−b∥≤ϵ∥b∥ in time:
O~((n2+nkω−1)log1/ϵ),
where k is the number of singular values of A larger than O(1) times its smallest positive singular value, ω<2.372 is the matrix multiplication exponent, and O~ hides a poly-logarithmic in n factor. When k=O(n1−θ) (namely, A has a flat-tailed spectrum, e.g., due to noisy data or regularization), this improves on both the cost of solving the system directly, as well as on the cost of preconditioning an iterative method such as conjugate gradient. In particular, our algorithm has an O~(n2) runtime when k=O(n0.729). We further adapt this result to sparse positive semidefinite matrices and least squares regression.
Our main algorithm can be viewed as a randomized block coordinate descent method, where the key challenge is simultaneously ensuring good convergence and fast per-iteration time. In our analysis, we use theory of majorization for elementary symmetric polynomials to establish a sharp convergence guarantee when coordinate blocks are sampled using a determinantal point process. We then use a Markov chain coupling argument to show that similar convergence can be attained with a cheaper sampling scheme, and accelerate the block coordinate descent update via matrix sketching.
Subjects: Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Numerical Analysis (math.NA); Optimization and Control (math.OC)
Cite as: arXiv:2312.08893 [cs.DS]
  (or arXiv:2312.08893v1 [cs.DS] for this version)
  https://doi.org/10.48550/arXiv.2312.08893

Submission history

From: Michał Dereziński [view email]
[v1] Thu, 14 Dec 2023 12:53:34 UTC (58 KB)

About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK