-
Notifications
You must be signed in to change notification settings - Fork 286
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Parallel Simulations #576
Comments
I have a strong feeling I know what would cause this since it's occurring in There is a special frame called the I think the solution will be to simply alter the behavior of the |
I think it would also be good to have some regression tests in DART for parallel simulations like this. |
I agree in principal, but it's worth noting that writing unit tests for race conditions is unreliable, since race conditions get triggered based on the operating system's scheduling, which we don't have control over. We can (and probably should) still have unit tests for concurrency, but we need to recognize that if a test passes it could just be a false positive. |
I agree with you on the point, reproducing a race condition is probably not always possible. But it would be better than nothing. 😄 We might want to start with Konstantinos's case. |
Thanks for the prompt answer! When you have the patch, I'll gladly check if it solves the problem. I could also write a small test to check (not unit test - an evolutionary algorithm seems like a better fit) for parallel simulations. |
I managed to create a unittest which reliably reproduced a race condition that I believe is analogous to the one you were experiencing. You can find it in pull request #577 on branch grey/fix_world_concurrency. Whenever you get a chance to test it out, please let me know if it resolved your issue. |
It seems to be working. I will let a big job running, so that we can be sure. I will let you know if it works in around 2 days. Thanks! |
Happy New Year to all of you! It seems to work quite good! |
Fixed in #577 |
Hello,
Merry Christmas to everyone.
I am running in parallel multiple simulations (around 24) and I am getting seg-faults. This is the backtrace:
The code I am using is extremely simple:
This thing runs in parallel. The code can be found (not the parallel one - I tried using simple pthreads and tbb and I am getting the same seg fault) here.
They are obviously sharing something in the
Frame
, but I cannot find exactly what. Also, the seg-faults only appear when deleting the objects.Any ideas?
The text was updated successfully, but these errors were encountered: