Add fallback to gesvd if gedd LAPACK fails in numpy backend #962

gefux · 2022-05-10T11:22:52Z

Solves: #896

Hi TN people!
Thank you very much for your work so far!

Much like in #896 users of our project OQuPy experience sometimes non-reproducible "SVD did not converge" errors. Very unpleasantly they occur more often with large tensors that are often the result of hours of computation time. I believe most (if not all) of these errors could be avoided by falling back to the _gesvd LAPACK routine in scipy.

Numpy uses the _gesdd LAPACK routine for performing SVDs. In a nutshell: _gesdd routine is for large matrices much faster then the _gesvd routine, but _gesvd is more stabil for badly conditioned matrices.

If one searches for "scipy gesdd SVD did not converge" one finds a lot of issues on github and stackoverflow and 90% of the conversation can be summarised as "I cannot reproduce your/my issue."

In this Issue, @pv (Pauli Virtranen) is pointing out:

Floating-point reproducibility is generally not guaranteed (see e.g. intel compiler manuals) on any platform, even for code in the same program can give different results due to e.g. data alignment, unless you take special care with library and compiler options, as it depends on things such as compiler optimizations. This is not generally considered a bug, and correct code should not assume it. It is a speed/precision tradeoff, and official binaries provided by scipy project do not ensure 100% fp reproducibility.

However, when browsing through the dozens of issues, it seems that they are often resolve when using the _gesvd LAPACK routine (through scipy). Because _gesdd is mostly much faster I think it is reasonable to keep this as the default, but fall back to the _gesvd routine if the _gesdd fails. This is also done in the TeNPy package here.

Following the Zen of Python: "Errors should never pass silently.", I've added a warning to the fall back to gesdd.
Unfortunately, I don't know how to test it, because I don't know how to construct a matrix that will fail reliably with gesdd.

gefux mentioned this pull request May 10, 2022

numpy.linalg.LinAlgError: SVD did not converge #896

Closed

Add fallback to gesvd if gesdd LAPACK fails in numpy backend

6e77a2b

gefux force-pushed the pr/robust-lapack-svd branch from e1a5f83 to 6e77a2b Compare May 10, 2022 11:47

gefux mentioned this pull request May 13, 2022

Use a stable SVD function call tempoCollaboration/OQuPy#65

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add fallback to gesvd if gedd LAPACK fails in numpy backend #962

Add fallback to gesvd if gedd LAPACK fails in numpy backend #962

gefux commented May 10, 2022 •

edited

Loading

Add fallback to gesvd if gedd LAPACK fails in numpy backend #962

Are you sure you want to change the base?

Add fallback to gesvd if gedd LAPACK fails in numpy backend #962

Conversation

gefux commented May 10, 2022 • edited Loading

gefux commented May 10, 2022 •

edited

Loading