ENH: stats: allow bootstrap to use CuPy #63

mdhaber · 2021-09-07T01:12:33Z

Experimental use of CuPy in scipy.special.bootstrap

rgommers · 2021-09-13T13:33:32Z

This is an interesting experiment. I believe we will not want to use an explicit import of CuPy within SciPy, rather we will want to use the array API standard and __array_namespace__. But that's not hard to change in the near future (once support in CuPy and NumPy is complete). The interesting part is that there is a large benefit.

mdhaber · 2021-12-06T19:47:19Z

I'm seeing ~10x speedup on this. (Update: 50x using rand and casting instead of randint)

import numpy as np
import cupy as cp
from scipy import stats

data_np = np.random.rand(10000)
rng_np = np.random.RandomState(0)
res_np = stats.bootstrap((data_np,), np.std, batch=1000,
                         random_state=rng_np, xp=np)  # 2.8 s ± 9.26 ms per loop 

data_cp = cp.array(data_np)
rng_cp = cp.random.RandomState(0)
res_cp = stats.bootstrap((data_cp,), cp.std, batch=1000, 
                         random_state=rng_cp, xp=cp)  # 240 ms ± 518 µs per loop

Computing the statistic is ~90x faster, which is more in line with what I was expecting (~1% of the total time now)
Generating the resamples is only ~4x faster, which is the bottleneck (~80% of the total time). Looking deeper, 97.2% of that time is the random number generation; only 2.8% is the indexing. This is a known issue: cupy.random.randint is slow cupy/cupy#4120. Using xp.rand, multiplying is much faster, bringing the total execution time from 240ms to 53ms.
In this case, calculating the BCa interval is ~10x faster, but that's (~18% of the total time). Again, actually computing the statistic is very fast. There is some overhead in using the GPU for the tiny calculations, like calculating the statistic for the observed data (only) and the use of cupy.scipy.special.ndtr. The slow part is again generating the resamples - specifically a reshape operation in _jackknife_resample. I could probably find a more efficient way to generate the jacknife resamples on the GPU.

No need to leave this PR open. Concept has been demonstrated. Actual implementation will depend on how SciPy decides to handle other array backends in general.

DOC: 1.10 release notes updates for scipy.stats

ENH: stats: allow bootstrap to use CuPy

b9b2a52

github-actions bot added the scipy.stats label Sep 7, 2021

mdhaber mentioned this pull request Sep 7, 2021

ENH: MemoryError when trying to bootstrap with large amounts of data scipy/scipy#14645

Closed

ENH: stats: replace slow cp.random.randint with .rand()*n + cast

7a58fbc

mdhaber closed this Dec 6, 2021

mdhaber pushed a commit that referenced this pull request Dec 31, 2022

Merge pull request #63 from mdhaber/gh17569

28ea073

DOC: 1.10 release notes updates for scipy.stats

mdhaber mentioned this pull request Mar 4, 2023

SciPy integrate module that can be run on CUDA scipy/scipy#18091

Closed

tupui mentioned this pull request Apr 12, 2023

ENH: port scipy.signal._arraytools to be Array API compatible scipy/scipy#15395

Closed

rgommers mentioned this pull request Apr 12, 2023

RFC: SciPy array types & libraries support scipy/scipy#18286

Open

mdhaber mentioned this pull request Apr 25, 2024

ENH: _array_api.Generator: unified RNG interface scipy/scipy#20549

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: stats: allow bootstrap to use CuPy #63

ENH: stats: allow bootstrap to use CuPy #63

mdhaber commented Sep 7, 2021

rgommers commented Sep 13, 2021

mdhaber commented Dec 6, 2021 •

edited

Loading

ENH: stats: allow bootstrap to use CuPy #63

ENH: stats: allow bootstrap to use CuPy #63

Conversation

mdhaber commented Sep 7, 2021

rgommers commented Sep 13, 2021

mdhaber commented Dec 6, 2021 • edited Loading

mdhaber commented Dec 6, 2021 •

edited

Loading