Attempt cache thrashing fix for low synapse count vector access #3113

otcathatsya · 2024-02-24T01:15:33Z

This is an attempt to fix an observed niche issue where, on a C++ level, connection creation will be slowed down by the num_connections_ vector accessing inner indices on the same cache line and therefore invalidating memory across threads, usually when there is only a few synapse models (e.g. when running with -Dwith-modelset=iaf_minimal)
Occurs mostly commonly with jemalloc:

Allocations are packed tightly together, which can be an issue for multi-threaded applications. If you need to assure that allocations do not suffer from cacheline sharing, round your allocation requests up to the nearest multiple of the cacheline size, or specify cacheline alignment when allocating.

Very much experimental, this fixes the time_construction_connect difference in SLI tests but makes no difference on a Python level (neither did the problem occur when timing from Python to begin with. Too insignificant?)
This does increase memory consumption up to a certain point (the L1 cache line usually is 64 bytes), but stays below full modelset levels; it is actually lower when running the full modelset with this alignment.
Maybe the destructive interference constexprs can be useful for something else further down the line
If this does get considered it should probably be amended to include -Wno-interference-size to suppress the compile warnings about this value varying between platforms, or be an optional CMake flag altogether
To-do: find a way to apply the destructive interference offset to the outer vector for more memory savings

github-actions · 2024-04-26T08:33:59Z

Pull request automatically marked stale!

terhorstd · 2024-09-09T10:31:23Z

@suku248 and @heplesser want to be in the loop for this.

Attempt cache thrashing fix

d55d1aa

otcathatsya requested review from heplesser and terhorstd February 24, 2024 01:15

otcathatsya added T: Discussion Still searching for the right way to proceed / suggestions welcome S: Normal Handle this with default priority I: No breaking change Previously written code will work as before, no one should note anything changing (aside the fix) labels Feb 24, 2024

otcathatsya added 2 commits February 24, 2024 19:41

Run clang-format

720e9d9

Add feature version test

46d21cf

github-actions bot added the stale Automatic marker for inactivity, please have another look here label Apr 26, 2024

terhorstd requested a review from gtrensch September 9, 2024 10:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attempt cache thrashing fix for low synapse count vector access #3113

Attempt cache thrashing fix for low synapse count vector access #3113

otcathatsya commented Feb 24, 2024 •

edited

Loading

github-actions bot commented Apr 26, 2024

terhorstd commented Sep 9, 2024

Attempt cache thrashing fix for low synapse count vector access #3113

Are you sure you want to change the base?

Attempt cache thrashing fix for low synapse count vector access #3113

Conversation

otcathatsya commented Feb 24, 2024 • edited Loading

github-actions bot commented Apr 26, 2024

terhorstd commented Sep 9, 2024

otcathatsya commented Feb 24, 2024 •

edited

Loading