Improved sequence unrolling performance #79

Toreil · 2025-02-24T12:12:11Z

Rewrite of the way the sequence is unrolled with a substantial improvement in performance. Also appears to have addressed an issue where. depending on resolution/field of view, the entire sequence would be executed instantaneously.

…o be tested in imaging to ensure there's no unexpected changes.

…ility

…rors.

…er of block samples to avoid timing errors

Fixes deblanking IO signal not being transmitted

Fixes improper handling of rf ring down time

Shims were added to the gradient waveform but also set as offsets on the card level, meaning shim levels were double what they should be. Shims are no longer added to the gradient waveform and now use the card offsets exclusively. This has the added benefit of hugely increasing sequence unrolling speed.

Removes unnecessary usage of SimpleNamespace for information about ADC events, just returns the ADC gate signal and reference waveform now. Also addressed some linting issues

Fixed gradient limit calculations. Now correctly throw error if gradient waveform exceeds limits and a seperate error if the combined waveform and shim offset exceeds limits. Also fixed linting errors

Address linting errors

github-actions · 2025-02-24T12:13:21Z

Coverage Report

File	Stmts	Miss	Cover	Missing
src/console/interfaces
acquisition_data.py	120	24	80%	127–132, 141–142, 148–149, 152–153, 157–161, 176–177, 188–193, 199, 245
acquisition_parameter.py	63	4	94%	77, 92, 115, 143
src/console/pulseq_interpreter
sequence_provider.py	256	44	83%	89, 91, 93, 95–97, 113, 143, 148, 151–153, 185–188, 216–219, 233–234, 274, 298, 322, 331–334, 342–344, 451, 457, 462–466, 492, 522–526, 546, 616–617
src/console/spcm_control
abstract_device.py	63	63	0%	2–119
acquisition_control.py	160	160	0%	3–373
rx_device.py	200	200	0%	2–474
tx_device.py	212	212	0%	2–509
src/console/spcm_control/spcm
pyspcm.py	143	143	0%	3–276
tools.py	52	52	0%	3–122
src/console/utilities
json_encoder.py	8	3	62%	22–24
load_config.py	27	27	0%	2–119
plot.py	17	17	0%	2–26
src/console/utilities/sequences/calibration
fid_tx_adjust.py	18	1	94%	54
se_tx_adjust.py	25	2	92%	56–57
src/console/utilities/sequences/spectrometry
fid.py	15	15	0%	2–61
se_spectrum.py	24	2	92%	42, 65
src/console/utilities/sequences/tse
tse_3d.py	243	82	66%	109–110, 112–113, 115–116, 123–124, 126–128, 130–133, 136–147, 152–155, 162–173, 190, 212–219, 225, 254, 317–334, 341–342, 407, 410–422, 440, 442, 531–541
TOTAL	4891	1051	79%

Tests	Skipped	Failures	Errors	Time
209	0 💤	0 ❌	0 🔥	9.806s ⏱️

schote

Thank you for the PR @Toreil! I added some comments to the code, have you been able to verify the sequence calculation updates? Maybe a direct comparison of the unrolled sequence waveforms from the old implementation vs. the new implementation is a good test.

src/console/pulseq_interpreter/sequence_provider.py

schote · 2025-02-24T16:46:56Z

Were you able to find out what caused the problem you are describing?

Toreil · 2025-02-25T08:03:19Z

Were you able to find out what caused the problem you are describing?

Not specifically. I saw the same thing happen when I did the bit shift for the acquisition gate signals in the wrong direction which essentially caused the gate to be high continuously, so whatever the bug was in the old code it must've been causing that gate signal to be high.

The other time I saw incorrect acquisition gating was with the fix I pushed previously which was when trying to replicate the control signals on the extra GPIO cards when those wernt installed. It would cause the TX card (process?) to crash but it would have jitter on the output of the GPIO pins on the card so the Rx card would just randomly acquire a few samples here and there. Very different behaviour, so I think it was related to the gate signal, not something else.

Toreil · 2025-02-26T11:26:50Z

Thank you for the PR @Toreil! I added some comments to the code, have you been able to verify the sequence calculation updates? Maybe a direct comparison of the unrolled sequence waveforms from the old implementation vs. the new implementation is a good test.

Yeah I did a direct comparison both on the bench and with imaging experiments and performance was identical. As far as I can tell merge is ready to go (I don't understand the failed tests, I ran them locally and there was no problem). I also tested seperately on Python 3.12 and confirm it's all working so the dependencies can be updated to reflect that.

schote · 2025-02-26T12:09:14Z

Thank you for the PR @Toreil! I added some comments to the code, have you been able to verify the sequence calculation updates? Maybe a direct comparison of the unrolled sequence waveforms from the old implementation vs. the new implementation is a good test.

Yeah I did a direct comparison both on the bench and with imaging experiments and performance was identical. As far as I can tell merge is ready to go (I don't understand the failed tests, I ran them locally and there was no problem). I also tested seperately on Python 3.12 and confirm it's all working so the dependencies can be updated to reflect that.

That is great, but I think it would be good to have an objective measure. For example. by unrolling a reference sequence with the current and the updated version of the sequence provider and reporting the sum of absolute difference between the resulting arrays (should be 0). I know this causes some extra work, but we have had some bugs in the past because we did not check changes carefully. I will have a look at the pytest error.
Regarding the static tests, mypy throws some errors. You should be able to see them by running mypy src/ in the project root. The CI which executes the test always uses a fresh install of the package. It could be, that there is a version missmatch. Let me know, if you need further support with this.

The phase of RF pulses is set relative to time since the first RF pulse of the sequence. This commit fixes an issue where that time was not being considered properly.

Toreil · 2025-03-04T12:24:38Z

That is great, but I think it would be good to have an objective measure. For example. by unrolling a reference sequence with the current and the updated version of the sequence provider and reporting the sum of absolute difference between the resulting arrays (should be 0). I know this causes some extra work, but we have had some bugs in the past because we did not check changes carefully. I will have a look at the pytest error. Regarding the static tests, mypy throws some errors. You should be able to see them by running mypy src/ in the project root. The CI which executes the test always uses a fresh install of the package. It could be, that there is a version missmatch. Let me know, if you need further support with this.

I ran a set of comparisons using the example sequences in the examples folder since they cover all basic functions of Pulseq. There are/were small differences but they're all attributable to rounding of the number of samples (which for shaped RF pulses has an outsized visual effect due to using resample which depends on the number of samples). The fast-unroll uses round to cast to int, whereas the main branch casts to int which always rounds down. Typically this manifests as a 20 us delay being 400 samples on the fast-unroll and 399 on the main branch of the sequence unroll for instance. I propose that since the small changes are due to round being used and round gives a more accurate number of samples the small differences are not detrimental (perhaps even an improvement) and this branch is ready to merge.

schote · 2025-03-05T08:35:14Z

That is great, but I think it would be good to have an objective measure. For example. by unrolling a reference sequence with the current and the updated version of the sequence provider and reporting the sum of absolute difference between the resulting arrays (should be 0). I know this causes some extra work, but we have had some bugs in the past because we did not check changes carefully. I will have a look at the pytest error. Regarding the static tests, mypy throws some errors. You should be able to see them by running mypy src/ in the project root. The CI which executes the test always uses a fresh install of the package. It could be, that there is a version missmatch. Let me know, if you need further support with this.

I ran a set of comparisons using the example sequences in the examples folder since they cover all basic functions of Pulseq. There are/were small differences but they're all attributable to rounding of the number of samples (which for shaped RF pulses has an outsized visual effect due to using resample which depends on the number of samples). The fast-unroll uses round to cast to int, whereas the main branch casts to int which always rounds down. Typically this manifests as a 20 us delay being 400 samples on the fast-unroll and 399 on the main branch of the sequence unroll for instance. I propose that since the small changes are due to round being used and round gives a more accurate number of samples the small differences are not detrimental (perhaps even an improvement) and this branch is ready to merge.

Could you share some of these results then? Maybe just post some magnitude/phase images of the different states here for documentation, that would be great!
I'll have a look at the remaining errors in pytest and the static tests.

berksilemek · 2025-03-05T23:37:31Z

Hi all,

Do not have the full story here. The conversation triggered me: if int or rounding errors causes some phase issues that I have not been able to solve at issue #66 , which was apparent at 3T experiments as well. Have you tested if Decimal works (worse calculation performance compared to round or int but maybe the most stable for imaging) ? I implemented in Rx device as follows:

# Calculate gate duration
gate_length = Decimal(str(timestamp_1)) - Decimal(str(timestamp_0))

# Calculate the number of adc gate sample points (per channel)
gate_sample = int(round(gate_length * (Decimal(str(self.sample_rate)) * Decimal("1e6"))))

This fixes small rounding errors for delay and phase instabilities as mentioned above.

schote · 2025-03-06T08:47:01Z

Hi all,

Do not have the full story here. The conversation triggered me: if int or rounding errors causes some phase issues that I have not been able to solve at issue #66 , which was apparent at 3T experiments as well. Have you tested if Decimal works (worse calculation performance compared to round or int but maybe the most stable for imaging) ? I implemented in Rx device as follows:
# Calculate gate duration
gate_length = Decimal(str(timestamp_1)) - Decimal(str(timestamp_0))

# Calculate the number of adc gate sample points (per channel)
gate_sample = int(round(gate_length * (Decimal(str(self.sample_rate)) * Decimal("1e6"))))   
This fixes small rounding errors for delay and phase instabilities as mentioned above.

Hi Berk,

thanks for your input on this, I think this is more about convention when unrolling the sequence.
Instead of truncating the calculated number of sample points by int(...), the use of round yields an extra sample point for some cases. However, because the waveforms are interpolated anyway, this does not matter. This is about a one data point on a 50 ns sampling grid.

Thank you for the readme update! Please use a seperate PR (even for such small changes) in the future.

…tions

Toreil · 2025-03-06T13:27:25Z

Comparison of gradient waveforms with the latest fix showing (essentially) identical waveforms, small difference are caused by different rounding of gradient event durations.

LUMC-LowFieldMRI and others added 13 commits December 10, 2024 12:30

Updated description of possible k trajectories in the 3D TSE

5b92c76

Substantially improved performance of sequence unrolling, still has t…

e49247c

…o be tested in imaging to ensure there's no unexpected changes.

Switched to checking block events in unrolling to improve code readab…

6aa74b7

…ility

Shim offsets now included in gradient waveforms to reduce rounding er…

5fb5d2b

…rors.

Added check to see if the number of sequence samples matches the numb…

bb2a41b

…er of block samples to avoid timing errors

Update sequence_provider.py

c076248

Fixes deblanking IO signal not being transmitted

Update sequence_provider.py

5eb02df

Fixes improper handling of rf ring down time

Removes use of SimpleNamespacefor ADC event list

5bca1a3

Removes unnecessary usage of SimpleNamespace for information about ADC events, just returns the ADC gate signal and reference waveform now. Also addressed some linting issues

Fixed gradient limit calculations with shim offsets

7f78b15

Fixed gradient limit calculations. Now correctly throw error if gradient waveform exceeds limits and a seperate error if the combined waveform and shim offset exceeds limits. Also fixed linting errors

Address linting errors

aecc1e8

Address linting errors

Address linting errors

a66e64f

Addresses linting issues

5a0750d

Toreil added the priority: medium label Feb 24, 2025

Toreil assigned schote Feb 24, 2025

schote assigned Toreil and unassigned schote Feb 24, 2025

schote added this to the System Stability, Robustness and Performance milestone Feb 24, 2025

schote requested changes Feb 24, 2025

View reviewed changes

Toreil added 2 commits February 25, 2025 09:59

Added extra checks and comments

0494711

Addresses linting and test findings

75a9593

Toreil requested a review from schote February 25, 2025 10:46

schote linked an issue Mar 3, 2025 that may be closed by this pull request

Memory usage increases substantially after unrolling the sequence, when starting the acquisition. #64

Open

Fixed phase offsets for RF pulses

d860e31

The phase of RF pulses is set relative to time since the first RF pulse of the sequence. This commit fixes an issue where that time was not being considered properly.

schote added 4 commits March 5, 2025 12:49

Fixed linter findings, added variables for indexing

bb20b3d

Fixed concat block_pos

ee8ceb4

Fixed check for cached sequence

86858a7

Fixed tests with latest ruff/mypy versions

239da5e

schote approved these changes Mar 5, 2025

View reviewed changes

berksilemek added 3 commits March 5, 2025 16:13

Added publication reference and acknowladgements to the readme file

7467090

fixed typo at the latest commit in readme file

9d4db7e

Fixed to full author list and added link to the article.

0c5968d

Toreil added 2 commits March 6, 2025 14:15

Fixes error when two simultanious gradient events have different dura…

c97590c

…tions

Address linting errors

0ce0284

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved sequence unrolling performance #79

Improved sequence unrolling performance #79

Toreil commented Feb 24, 2025

github-actions bot commented Feb 24, 2025 •

edited

Loading

schote left a comment

schote commented Feb 24, 2025

Toreil commented Feb 25, 2025

Toreil commented Feb 26, 2025

schote commented Feb 26, 2025 •

edited

Loading

Toreil commented Mar 4, 2025

schote commented Mar 5, 2025

berksilemek commented Mar 5, 2025 •

edited

Loading

schote commented Mar 6, 2025

Toreil commented Mar 6, 2025

Improved sequence unrolling performance #79

Are you sure you want to change the base?

Improved sequence unrolling performance #79

Conversation

Toreil commented Feb 24, 2025

github-actions bot commented Feb 24, 2025 • edited Loading

schote left a comment

Choose a reason for hiding this comment

schote commented Feb 24, 2025

Toreil commented Feb 25, 2025

Toreil commented Feb 26, 2025

schote commented Feb 26, 2025 • edited Loading

Toreil commented Mar 4, 2025

schote commented Mar 5, 2025

berksilemek commented Mar 5, 2025 • edited Loading

schote commented Mar 6, 2025

Toreil commented Mar 6, 2025

github-actions bot commented Feb 24, 2025 •

edited

Loading

schote commented Feb 26, 2025 •

edited

Loading

berksilemek commented Mar 5, 2025 •

edited

Loading