ENH: allow using symbol-suffixed 64-bit BLAS/LAPACK for numpy.dot and linalg #15012

pv · 2019-11-29T19:57:13Z

(This is partially for discussion, but should be functional:)

Add support for 64-bit BLAS/LAPACK with 64_ symbol suffix (henceforth "BLAS64_").

BLAS64_ is 64-bit BLAS with 64-bit integer interface (ILP64) with 64_ appended to all symbol names. This what you get with INTERFACE64=1 SYMBOLSUFFIX=64_ in OpenBLAS, and is the approach used by Julia. Fedora also has 64-bit OpenBLAS packages with this configuration. The point with this is that you can safely link both to 32-bit and 64-bit BLAS at the same time, without having to coordinate all packages. In particular, numpy can link to BLAS64_ and you don't get segfaults if you do import numpy from a program that also links to 32-bit BLAS (which would be the case with 64-bit BLAS/LAPACK if done in the usual way --- for ELF it mostly problem for programs embedding Python, not so much for standalone Python, but Numpy probably cares about both use cases and the 'usual' way is no-go).

See OpenMathLib/OpenBLAS#646 and JuliaLang/julia#4923 (and also #13956 , #5906 )

This PR adds numpy.distutils support for detecting OpenBLAS64_, based on the library name and checking it actually contains the correct symbols.

It adds a build-time environment variable NPY_USE_BLAS64_=1. If it's set, Numpy gets linked only with 64-bit BLAS. It's off by default for now, but it should be safe to turn it to be on by default later on.

The 64-bit BLAS functions are then used to implement cblasfuncs.c, so that numpy.dot et al. get the 64-bit goodness. Moreover, the latter commits also implement its usage in numpy.linalg.

Addresses: #5533 (if you have the right 64-bit BLAS installed). Moreover, the numpy.distutils support should be useful for all downstream libraries.

(I don't have a machine with enough memory to check this actually fixes the issues with n > 2**32-1 matrix products, but the call does end up in libopenblas64_.so so it should.)

This is an emerging "standard" for 64-bit BLAS/LAPACK, avoiding symbol clashes with 32-bit BLAS/LAPACK, originally introduced for Julia. OpenBLAS can be compiled with 64-bit integer size and symbol suffix '64_' (INTERFACE64=1 SYMBOLSUFFIX=64_). OpenBLAS built with this setting is also provided by some Linux distributions (e.g. Fedora's 64-bit openblas packages).

This is enabled by setting NPY_USE_BLAS64_=1 environment variable at build time. If set, Numpy will be linked both against a 32-bit BLAS and a symbol-suffixed 64-bit BLAS. (This is safe and does not cause symbol clashes thanks to the suffixing.)

…able When Numpy is compiled with BLAS64_ enabled, use the 64-bit CBLAS routines to provide cblasfuncs, and dot() et al.

isuruf · 2019-12-01T00:42:25Z

I checked with OpenBLAS and this works. I also installed a scipy with MKL and replaced libopenblas64_.so with a shared library created using MKL static libs exporting only blas, cblas, lapack symbols and renamed the symbols with a 64_ suffix. There was no crash when importing scipy and numpy and numpy called 64_ variants.

mattip · 2019-12-01T05:46:07Z

numpy/core/src/common/cblasfuncs.c

@@ -10,10 +10,20 @@
 #include <assert.h>
 #include <numpy/arrayobject.h>
 #include "npy_cblas.h"
+#include "npy_cblas64_.h"


Why the trailing _ in the name?

To match the symbol name convention.

mattip · 2019-12-01T06:03:41Z

This is really useful for people with lots of memory. We should add checks so that calling BLAS with a large array does not segfault as in this example, maybe as a separate issue since this PR is quite large as it is.

Name bikeshedding: is the 64_ name the standard?

pv · 2019-12-01T10:47:32Z

There's of course no real standard suffix, but 64_ is used by Julia and apparently by Sunperf, and Fedora already ships packages built with it. To avoid proliferation of incompatible conventions, I believe we should not deviate from this choice.

pv · 2019-12-01T12:22:34Z

I'll add the test in a separate PR, as we'll probably need some machinery to stop it from running on low-memory systems, which it would trash to standstill...

mattip · 2019-12-01T12:50:30Z

There is a test for large zip files that tries to allocate and skips if it cannot, with a slow marker as well so it is not typically run

pv · 2019-12-01T13:22:50Z

The "user experience" with not checking beforehand is sometimes bad (i.e. machine hangs). gh-15021 has a more elaborate suggestion for the low-memory check.

pv · 2019-12-01T14:02:51Z

Azure failures seem spurious ("[NuGet] The operation has timed out" when trying to install mingw).

charris · 2019-12-02T17:55:01Z

Thanks Pauli. It will be good to get this out there. If nothing else it will help set the standard for the 64 bit move.

eric-wieser · 2019-12-02T17:56:32Z

Should this have a release note?

pv · 2019-12-02T19:40:30Z

Probably yes. I forgot numpy is using towncrier, sorry. Will have time for this onlt later this week, so if someone beats me to it, ok. I also looked at getting matthew's manylinux/etc openblas builds produce 64_ variant, cf. pv/openblas-libs@blas64_ -- quite fiddly to deal with the cis, but that's basically finished now, even if didn't have time to actually test the product vs numpy. Anyway, getting that merged to production is probably required before eg. adding blas64 travis target.

…

On December 2, 2019 5:56:33 PM UTC, Eric Wieser ***@***.***> wrote: Should this have a release note?

pv · 2019-12-02T19:43:28Z

The big picture being that 64bit pypi wheels could be eventually switched to use 64bit blas, of course. But I guess this should cook for some time in master first and maybe see some more serious use.

…

On December 2, 2019 7:40:25 PM UTC, Pauli Virtanen ***@***.***> wrote: Probably yes. I forgot numpy is using towncrier, sorry. Will have time for this onlt later this week, so if someone beats me to it, ok. I also looked at getting matthew's manylinux/etc openblas builds produce 64_ variant, cf. ***@***.***_ -- quite fiddly to deal with the cis, but that's basically finished now, even if didn't have time to actually test the product vs numpy. Anyway, getting that merged to production is probably required before eg. adding blas64 travis target. On December 2, 2019 5:56:33 PM UTC, Eric Wieser ***@***.***> wrote: >Should this have a release note?

refraction-ray · 2019-12-06T07:02:24Z

Really nice to see numpy is becoming more 64bit and all the great efforts toward this!
Curious about symbol conventions for ilp64 interface of MKL and how to link this 64bit aware numpy to mkl ilp64 interface instead of openblas. Any ideas?

isuruf · 2019-12-06T07:25:21Z

@refraction-ray, to do that, you'll have to create a shared library from MKL static library exporting only the BLAS, LAPACK symbols and then renaming those symbols.

refraction-ray · 2019-12-06T07:41:43Z

@isuruf by renaming, you mean using GNU objcopy to rename all symbol names following the new 64bit prefix convention openblas and numpy utilized? And I am not sure about only export BLAS LAPACK symbols part. What is the danger if all symbols are exported to the new .so file and how to fast specify all BLAS and LAPACK symbols to be exported?

isuruf · 2019-12-06T07:49:49Z

@isuruf by renaming, you mean using GNU objcopy to rename all symbol names following the new 64bit prefix convention openblas and numpy utilized?

yes.

What is the danger if all symbols are exported to the new .so file

There are symbols already in the MKL libs with and without 64_ suffix.

how to fast specify all BLAS and LAPACK symbols to be exported?

Use a linker script.

Changing these to support ILP64 blas was missed in numpygh-15012

pv added 4 commits November 29, 2019 20:36

ENH: core: add 64_ suffixed cblas header

2d75dec

ENH: core: use symbol-suffixed 64-bit CBLAS in cblasfuncs, when avail…

acf8bcb

…able When Numpy is compiled with BLAS64_ enabled, use the 64-bit CBLAS routines to provide cblasfuncs, and dot() et al.

pv changed the title ~~ENH: allow using symbol-suffixed 64-bit BLAS for numpy.dot~~ ENH: allow using symbol-suffixed 64-bit BLAS/LAPACK for numpy.dot and linalg Nov 29, 2019

pv force-pushed the blas64_ branch from 1daabc7 to eb216aa Compare November 29, 2019 21:13

pv added 2 commits November 29, 2019 23:20

ENH: core: add LAPACK64_ support in numpy.linalg

52ce77f

ENH: core: link only against blas64_/lapack64_ when BLAS64_ set

33a2fcb

pv force-pushed the blas64_ branch from eb216aa to 33a2fcb Compare November 29, 2019 21:21

pv mentioned this pull request Nov 29, 2019

ENH: use OpenBLAS64 bit interfaces #13956

Closed

pv marked this pull request as ready for review November 30, 2019 15:31

mattip reviewed Dec 1, 2019

View reviewed changes

pv mentioned this pull request Dec 1, 2019

TST: machinery for tests requiring large memory + lapack64 smoketest #15021

Merged

DOC: document NPY_USE_BLAS64_ environment variable

3892cad

charris added 01 - Enhancement component: numpy.linalg component: numpy._core component: build component: numpy.distutils labels Dec 2, 2019

charris merged commit 31e8b55 into numpy:master Dec 2, 2019

blechta mentioned this pull request Dec 4, 2019

Matlab/NumPy BLAS incompatibility #15049

Closed

pv mentioned this pull request Dec 4, 2019

MAINT: follow-up cleanup for blas64 PR #15052

Merged

This was referenced Dec 7, 2019

ENH: add support for ILP64 OpenBLAS (without symbol suffix) #15069

Merged

ENH: build also openblas64_ 64-bit symbol-suffixed library MacPython/openblas-libs#8

Merged

pv added a commit to pv/numpy that referenced this pull request Dec 14, 2019

BUG: core: use blas_ilp64 also for *_matmul, *_dot, and *_vdot

de8a10d

Changing these to support ILP64 blas was missed in numpygh-15012

charris mentioned this pull request Dec 15, 2019

MAINT: follow-up cleanup for blas64 PR #15112

Merged

charris pushed a commit to charris/numpy that referenced this pull request Dec 15, 2019

BUG: core: use blas_ilp64 also for *_matmul, *_dot, and *_vdot

9466d31

Changing these to support ILP64 blas was missed in numpygh-15012

seberg mentioned this pull request Apr 16, 2020

IPCA did not converge, numpy.linalg.LinAlgError: SVD did not converge #15996

Closed

Uh oh!

ENH: allow using symbol-suffixed 64-bit BLAS/LAPACK for numpy.dot and linalg #15012

ENH: allow using symbol-suffixed 64-bit BLAS/LAPACK for numpy.dot and linalg #15012

Uh oh!

Conversation

pv commented Nov 29, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

isuruf commented Dec 1, 2019

Uh oh!

mattip Dec 1, 2019

Choose a reason for hiding this comment

Uh oh!

pv Dec 1, 2019

Choose a reason for hiding this comment

Uh oh!

mattip commented Dec 1, 2019

Uh oh!

pv commented Dec 1, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pv commented Dec 1, 2019

Uh oh!

mattip commented Dec 1, 2019

Uh oh!

pv commented Dec 1, 2019

Uh oh!

pv commented Dec 1, 2019

Uh oh!

charris commented Dec 2, 2019

Uh oh!

eric-wieser commented Dec 2, 2019

Uh oh!

pv commented Dec 2, 2019 via email

Uh oh!

pv commented Dec 2, 2019 via email

Uh oh!

refraction-ray commented Dec 6, 2019

Uh oh!

isuruf commented Dec 6, 2019

Uh oh!

refraction-ray commented Dec 6, 2019

Uh oh!

isuruf commented Dec 6, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

pv commented Nov 29, 2019 •

edited

Loading

pv commented Dec 1, 2019 •

edited

Loading

isuruf commented Dec 6, 2019 •

edited

Loading