Trying to speed up STIR acquisition data algebra by avoiding unnecessary 0-fill of newly created data. #1549

evgueni-ovtchinnikov · 2024-12-10T19:10:19Z

Changes in this pull request

Unnecessary 0-fill of newly created acquisition data reduced.

Testing performed

The following SIRF Python script (employing STIR via SIRF Python interface) was used for comparing STIR's ProjDataInMemory algebra performance with numpy:

import sirf.STIR as pet
pet.AcquisitionData.set_storage_scheme('memory')

acq = pet.AcquisitionData('prompts.hs')
x = acq.clone()
y = x*2

start = timeit.default_timer()
z = x + y
elapsed = timeit.default_timer() - start
print(f'z = x + y time {elapsed}')

start = timeit.default_timer()
x_arr = x.as_array()
y_arr = y.as_array()
elapsed = timeit.default_timer() - start
print(f'as_array time {elapsed}')
numpy_time = elapsed

start = timeit.default_timer()
z_arr = x_arr + y_arr
elapsed = timeit.default_timer() - start
print(f'numpy z = x + y time {elapsed}')
numpy_time += elapsed

start = timeit.default_timer()
z.fill(z_arr)
elapsed = timeit.default_timer() - start
print(f'fill time {elapsed}')
numpy_time += elapsed
print(f'total numpy z = x + y time {numpy_time}')

With a recent STIR, the output looks like

z = x + y time 0.5256878180002786
as_array time 0.23524911599997722
numpy z = x + y time 0.1330225659999087
fill time 0.07107986500022889
total numpy z = x + y time 0.4393515470001148

demonstrating that STIR's ProjDataInMemory algebra is much less efficient than numpy algebra applied to ProjDataInMemory data copied into numpy array. The investigation of this issue revealed that a considerable chunk of the computation time was wasted on zero-filling the newly created ProjDataInMemory object where x + y data is to end up.

This PR succeeded in reducing such waste, so that the timings have become

z = x + y time 0.24671401699015405
as_array time 0.23049703100696206
numpy z = x + y time 0.13044680299935862
fill time 0.0659959859913215
total numpy z = x + y time 0.4269398199976422

Related issues

Checklist before requesting a review

[] I have performed a self-review of my code
[] I have added docstrings/doxygen in line with the guidance in the developer guide
[] I have implemented unit tests that cover any new or modified functionality (if applicable)
The code builds and runs on my machine
[] documentation/release_XXX.md has been updated with any functionality change (if applicable)

KrisThielemans · 2024-12-11T08:48:49Z

Ubuntu jobs fail, see #1550

MacOS is unaffected by that, but has

ERROR: NumericVectorWithOffset::xapyb: index ranges don't match

(note that this will be seen in Debug mode only. Otherwise it will likely lead to a segfault)

KrisThielemans

While this would work, it could be a bit surprising to the user, as the effect of set_initialise_with_zeros() would be permanent. In the use-case that you have in mind, that'd be fine, but it'd have to come with serious warnings in the doxygen.

I'm rather in favour of having an explicit bool argument to grow and resize (defaulting to true). We could then have that for the constructor as well.

src/include/stir/Array.h

src/buildblock/ProjDataInMemory.cxx

KrisThielemans · 2024-12-11T12:10:31Z

Please also use pre-commit, see https://github.com/UCL/STIR/blob/master/documentation/devel/git-hooks.md

KrisThielemans · 2024-12-11T12:46:39Z

No point in pushing until I've sorted out #1552, which you'll have to merge here.

KrisThielemans · 2024-12-12T11:43:02Z

#1552 is now merged, so merge master back on this branch.

KrisThielemans · 2024-12-19T18:20:43Z

I'm rather in favour of having an explicit bool argument to grow and resize (defaulting to true). We could then have that for the constructor as well.

I still think this would be a better option. @danieldeidda @markus-jehl what do you think?

markus-jehl · 2024-12-20T09:00:36Z

I tend to agree with @KrisThielemans, that explicitly stating it may be cleaner rather than relying on an internal setting on individual instances of the Array class. Just because I can imagine running into weird issues and struggling for a while before noticing what is going on 😅

evgueni-ovtchinnikov added 2 commits December 9, 2024 19:28

employed initialise_with_0 flag to reduce unnecessary 0-fill

3b2b84c

corrected the default value of init_with_zeros_ in Array.h

377020b

KrisThielemans mentioned this pull request Dec 11, 2024

GitHub Actions fail, probably due to new Ubuntu 24.04 #1550

Closed

KrisThielemans requested changes Dec 11, 2024

View reviewed changes

src/include/stir/Array.h Outdated Show resolved Hide resolved

src/buildblock/ProjDataInMemory.cxx Outdated Show resolved Hide resolved

corrected ProjDataInMemory::create_buffer (size->max_index)

7b416af

evgueni-ovtchinnikov added 3 commits December 12, 2024 12:17

Merge branch 'master' into try-fix-algebra

ac31b6a

followed pre-commit suggestions

c3b25d6

adopted the reviewer's suggestions

3ff116d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Trying to speed up STIR acquisition data algebra by avoiding unnecessary 0-fill of newly created data. #1549

Trying to speed up STIR acquisition data algebra by avoiding unnecessary 0-fill of newly created data. #1549

evgueni-ovtchinnikov commented Dec 10, 2024

KrisThielemans commented Dec 11, 2024

KrisThielemans left a comment

KrisThielemans commented Dec 11, 2024

KrisThielemans commented Dec 11, 2024

KrisThielemans commented Dec 12, 2024

KrisThielemans commented Dec 19, 2024

markus-jehl commented Dec 20, 2024

Trying to speed up STIR acquisition data algebra by avoiding unnecessary 0-fill of newly created data. #1549

Are you sure you want to change the base?

Trying to speed up STIR acquisition data algebra by avoiding unnecessary 0-fill of newly created data. #1549

Conversation

evgueni-ovtchinnikov commented Dec 10, 2024

Changes in this pull request

Testing performed

Related issues

Checklist before requesting a review

KrisThielemans commented Dec 11, 2024

KrisThielemans left a comment

Choose a reason for hiding this comment

KrisThielemans commented Dec 11, 2024

KrisThielemans commented Dec 11, 2024

KrisThielemans commented Dec 12, 2024

KrisThielemans commented Dec 19, 2024

markus-jehl commented Dec 20, 2024