Add option to specify maximum number of substeps in field propagator #1236

amandalund · 2024-05-14T03:32:34Z

This allows the user to set the maximum number of field propagator substeps, which can have a significant impact on performance due to improved load balancing (see the plot below for the 32 ttbar CMS run3 results with three different max_substep values: 10, our default of 100, and Geant4's default of 1000).

I'm not really sure how best to safely access the user-specified max_substeps in the SimParams, which needs it to calculate the looping threshold values. In particular, through accel apps other than celer-g4 I'm not sure how to validate that the max_substeps in the setup options matches what's in the along-step factory. I'm also not entirely sure if changing this parameter has any consequence on the physics (msc?).

sethrj

I think this is fine for now, thanks @amandalund . I think to overcome this issue, and solve #538 and #983, we should have a "propagation params" that manages the various possible along-step kernels and would handle the looping parameter, which currently is used only in global/alongstep/detail/PropagationApplier.hh.

sethrj · 2024-05-15T12:25:23Z

Also, good work with figuring out this speedup! I wonder if we should adjust the default parameter based on this finding. I want to see whether the spikes in the time-per-step plot for CMS 2018+field disappear with this... which we have always suspected is due to looping.

amandalund · 2024-05-15T14:55:47Z

I think adding a propagation params is a good idea. It would definitely be interesting to see how changing the max substeps affects that plot... I also need to experiment with adjusting this parameter in some of our other test problems.

amandalund · 2024-05-20T22:42:36Z

@sethrj a different plot, but you can see how changing the max substeps affects the step times/variation in times. It definitely looks like those spikes are from looping tracks.

sethrj · 2024-05-20T23:19:38Z

Whoa why does the maximum 10 cut off at 6000 but the 100 live much longer? Is it because we're clearing track slots for the looping tracks earlier?

amandalund · 2024-05-20T23:47:36Z

It might just be a statistical fluctuation (though I was surprised too that the number of step iterations was so much hgher than with 10 or 1000)... I tried running the same problem with a different seed and it only took 5900 steps.

amandalund · 2024-05-22T16:12:02Z

@sethrj a couple more observations:

I ran 100 simulations with different seeds for each of these max substep values. Most of them finish in ~5500 step iterations, but there are some outliers that take many more. This is more of a problem when the substep limit is high: with 1000 max substeps there was one instance where the total number of step iterations was over 100k, with >95% of the steps having only one active track.
It looks like this is because of occasional low energy e-/e+ tracks in low density materials taking many (thousands) of small steps (but never being flagged as looping). Even though this might be relatively uncommon, especially with a lower max substeps, we may still want a strategy for killing (or un-sticking?) these tracks.

The last 20 steps of one of these tracks:

track 16419589, vol 1985, mat 406, particle 1, energy 3.9923808482957686e-02, step 6949, looping 0, step length 1.3858658842098418e-02
track 16419589, vol 2751, mat 381, particle 1, energy 3.9850266767637074e-02, step 6950, looping 0, step length 4.7244629498538651e-02
track 16419589, vol 1985, mat 406, particle 1, energy 3.9814529158514730e-02, step 6951, looping 0, step length 1.8629256656536680e-01
track 16419589, vol 2751, mat 381, particle 1, energy 3.8257980851032464e-02, step 6952, looping 0, step length 4.9934856957369343e-02
track 16419589, vol 1985, mat 406, particle 1, energy 3.7893562258098423e-02, step 6953, looping 0, step length 7.3445125203530637e-01
track 16419589, vol 2751, mat 381, particle 1, energy 3.1588001694221136e-02, step 6954, looping 0, step length 3.0593442567106464e-02
track 16419589, vol 1985, mat 406, particle 1, energy 3.1159181280340318e-02, step 6955, looping 0, step length 3.0731719856365525e-01
track 16419589, vol 2751, mat 381, particle 1, energy 2.8875578978385202e-02, step 6956, looping 0, step length 9.8504012780081442e-03
track 16419589, vol 1985, mat 406, particle 1, energy 2.8764161298133990e-02, step 6957, looping 0, step length 3.1567462632067628e-01
track 16419589, vol 2752, mat 381, particle 1, energy 2.6652691197892056e-02, step 6958, looping 0, step length 1.0000004250248819e-06
track 16419589, vol 2752, mat 381, particle 1, energy 2.6652682076712491e-02, step 6959, looping 0, step length 2.1192793848139056e-13
track 16419589, vol 1985, mat 406, particle 1, energy 2.6652682076710559e-02, step 6960, looping 0, step length 2.0257033488857515e-01
track 16419589, vol 2751, mat 381, particle 1, energy 2.3507798832861856e-02, step 6961, looping 0, step length 1.4031100172018471e-01
track 16419589, vol 2751, mat 381, particle 1, energy 2.2150049606347002e-02, step 6962, looping 0, step length 1.7791966459317970e-01
track 16419589, vol 2751, mat 381, particle 1, energy 2.0742165110699450e-02, step 6963, looping 0, step length 2.1894447090401701e-02
track 16419589, vol 2751, mat 381, particle 1, energy 1.8300777647732802e-02, step 6964, looping 0, step length 5.7666532505831737e-02
track 16419589, vol 2751, mat 381, particle 1, energy 1.6257806981944168e-02, step 6965, looping 0, step length 1.7311275991309827e-01
track 16419589, vol 2751, mat 381, particle 1, energy 1.3513392719547898e-02, step 6966, looping 0, step length 1.3732918277251066e-01
track 16419589, vol 2751, mat 381, particle 1, energy 9.6172212560103210e-03, step 6967, looping 0, step length 1.6222676351235218e-01
track 16419589, vol 2751, mat 381, particle 1, energy 3.9073150446839004e-03, step 6968, looping 0, step length 4.9231860431891476e-02

sethrj · 2024-05-23T06:30:54Z

Damn, it's looping between multiple volumes... maybe there's some heuristic with average step length/average energy loss per step that could be used...

Add option to specify max substeps in field propagator

d80dcf9

amandalund added enhancement New feature or request field Magnetic field and propagation performance Changes for performance optimization labels May 14, 2024

amandalund requested a review from sethrj May 14, 2024 03:32

sethrj approved these changes May 15, 2024

View reviewed changes

Merge branch 'develop' into field-max-substeps

02d6fc8

sethrj merged commit 567576d into celeritas-project:develop May 15, 2024
27 checks passed

amandalund deleted the field-max-substeps branch May 15, 2024 15:40

amandalund mentioned this pull request Aug 30, 2024

Reduce default maximum substeps in field propagator #1384

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add option to specify maximum number of substeps in field propagator #1236

Add option to specify maximum number of substeps in field propagator #1236

amandalund commented May 14, 2024

sethrj left a comment

sethrj commented May 15, 2024

amandalund commented May 15, 2024

amandalund commented May 20, 2024

sethrj commented May 20, 2024

amandalund commented May 20, 2024

amandalund commented May 22, 2024

sethrj commented May 23, 2024

Add option to specify maximum number of substeps in field propagator #1236

Add option to specify maximum number of substeps in field propagator #1236

Conversation

amandalund commented May 14, 2024

sethrj left a comment

Choose a reason for hiding this comment

sethrj commented May 15, 2024

amandalund commented May 15, 2024

amandalund commented May 20, 2024

sethrj commented May 20, 2024

amandalund commented May 20, 2024

amandalund commented May 22, 2024

sethrj commented May 23, 2024