feat: use a very large coil dimension for 2D stacked NUFFT #39

paquiteau · 2023-10-13T14:06:39Z

This PR improve the stacked nufft, by leveraging the coil dimension of the 2D operator.
For gpu backend which does asynchronous copy of coil data (e.g. only gpuNUFFT for now, cufinufft needs a few PR being merged upstream) this is great news, and potentially leads to increase in speed.

I still need to setup some rudimentary benchmark to compare between the two.

…hod.

paquiteau · 2023-10-16T20:11:48Z

Alright some simple timing (H2D2H) reports the followings on a QUADRO P5000 with the data from the example_stacked.py script (181x217x181 matrix , fully sampled z axis and 8000 NUPTS in a spiral pattern) extended to have 8 coils.

Backend	Forward	Adjoint
GPU 2D+Stack	2.0035	0.643
GPU 3D	1.40	4.165

Overall that's a 2x speed up on the Forward + Adjoint step (= a data consistency), it can be made faster by avoiding the roundtrip in the middle, and even more when cufinufft support the async copy / computations

The stacked 2D Nufft on GPU also has a lower memory footprint (the oversampling grid is only 4 times bigger , not 8 times) making it possible to potentially have multiple operators running together (think fMRI)

feat: use a very large coil dimension.

d95250f

paquiteau requested review from chaithyagr and philouc October 13, 2023 14:06

paquiteau changed the title ~~feat: use a very large coil dimension.~~ feat: use a very large coil dimension for 2D stacked NUFFT Oct 15, 2023

paquiteau added 4 commits October 16, 2023 14:09

refactor(cufinufft): use same conventions across _adj_op functions.

6d2504d

refactor(stacked): extract the samples preprocessing to dedicated met…

8816d22

…hod.

feat(stacked): add GPU backend for stacked nufft.

1cdbdb7

fix: stacked-cufinufft kwargs.

0f96c5f

paquiteau force-pushed the faster-stacked branch from eadde29 to 0f96c5f Compare October 16, 2023 12:43

paquiteau added 4 commits October 16, 2023 14:48

fix: init operator earlier.

e58f0f5

fix: use internal device methods.

510ce19

fix(stacked): typo

d7f1261

fix: correct shape.

8380064

paquiteau force-pushed the faster-stacked branch from e9b9885 to 8380064 Compare October 16, 2023 12:54

paquiteau added 4 commits October 16, 2023 21:52

this was the bug

04fe933

add stacked gpu

edc855c

add stacked gpu tests

5d760d4

fix: cleanup

b264671

paquiteau added 2 commits October 17, 2023 10:34

style: sort imports.

38372dc

ci: also install cupy.

a657168

paquiteau merged commit 11b1658 into master Oct 17, 2023
4 checks passed

paquiteau deleted the faster-stacked branch October 17, 2023 11:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: use a very large coil dimension for 2D stacked NUFFT #39

feat: use a very large coil dimension for 2D stacked NUFFT #39

paquiteau commented Oct 13, 2023 •

edited

Loading

paquiteau commented Oct 16, 2023

feat: use a very large coil dimension for 2D stacked NUFFT #39

feat: use a very large coil dimension for 2D stacked NUFFT #39

Conversation

paquiteau commented Oct 13, 2023 • edited Loading

paquiteau commented Oct 16, 2023

paquiteau commented Oct 13, 2023 •

edited

Loading