Parallelisation #204

ThomasASmith · 2018-07-11T12:31:44Z

Parallelisation of C++ code is more straightforward than it used to be and Monica plans to address this at some point

dhardy · 2018-07-11T15:01:40Z

We had quite a bit of discussion on various ways of having parallel RNGs and keeping results reproducible in Rust: rust-random/rand#399

Yes, parallelisation ~~is not~~ does not have to be especially hard (with the right tools).

For OpenMalaria I would suggest replacing the random number generator first (another topic we discussed for the Rust project) and consider the parallelisation model. There are reasonable quality PRNGs requiring only 8-16 bytes (whereas Mersenne Twister uses at least 2500). This would allow storing the PRNG state per human agent at birth, which would then let human updates be made to run in parallel while producing exactly the same results (i.e. this can be used for verification).

dhardy · 2018-12-10T17:02:41Z

We discussed this today. There is some interest in getting this working, but it is not considered high-priority since experiments can already be parallelised by running multiple scenarios simultaneously.

Potentially we could forfeit reproducibility when running in parallel, though I do not think we need to.

@Monica-Golumbeanu doesn't currently have time to work on this but suggests using OpenMP.

dhardy · 2019-10-10T16:14:56Z

PR #255 has solved the RNG issues (allocating each human agent its own generator seeded from a high-quality master with no observable performance impact), but other issues remain.

Most notably, the monitoring code works by reporting events into a global table; this requires some level of thread protection; unfortunately each option has a cost:

Use atomic variables in the table; unfortunately vector<atomic<..>> is not supported by the language and (until C++20 which isn't even standardised yet) doesn't support floating-point types; performance impact is unknown
Use a mutex to lock the entire table when reporting; likely has a significant performance cost (though this can be minimised when running single-threaded)
Use thread-local tables and accumulate at the end of each step or the simulation; this has a memory cost, run-time cost and code complexity cost

At this point I question the value of further work towards parallelisation given that most users are more interested in throughput performance (simulating many scenarios simultaneously).

ThomasASmith added the 2: medium label Jul 11, 2018

Monica-Golumbeanu self-assigned this Jul 19, 2018

dhardy self-assigned this Dec 14, 2018

This was referenced Dec 14, 2018

Replace random number generator? #194

Closed

parallelise-OM #106

Closed

ThomasASmith mentioned this issue Sep 5, 2019

Patch model #249

Closed

dhardy mentioned this issue Oct 10, 2019

Parallel #255

Merged

ThomasASmith closed this as completed Feb 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallelisation #204

Parallelisation #204

ThomasASmith commented Jul 11, 2018

dhardy commented Jul 11, 2018 •

edited

Loading

dhardy commented Dec 10, 2018

dhardy commented Oct 10, 2019 •

edited

Loading

Parallelisation #204

Parallelisation #204

Comments

ThomasASmith commented Jul 11, 2018

dhardy commented Jul 11, 2018 • edited Loading

dhardy commented Dec 10, 2018

dhardy commented Oct 10, 2019 • edited Loading

dhardy commented Jul 11, 2018 •

edited

Loading

dhardy commented Oct 10, 2019 •

edited

Loading