-
-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ccall
tests take a very long time to run on Buildkite, but only when running under rr
#42242
Comments
Looking at
I don't see anything that interesting in the
And the following for a "fast" trace:
So it looks like something is causing the "slow" version to have a huge number of [0] https://buildkite.com/julialang/julia-master/builds/3631#bd3067d3-e10e-4f46-bb55-8d7d25604ffc |
What is |
This is all being measured on AMDCI machines, and it is indeed set to |
@Keno your guess about scheduling problems seems to be on the right track. Do you know what a huge number of In case you want to look at this yourself, you can check out |
|
The bizarre thing is that it doesn't seem to happen when not running on CI. I haven't been able to identify what might be different between the two environments. :( |
Is there a filter running that blocks SYS_rrcall_check_presence, which is one of the high integer syscalls? |
I don't think so; we don't filter syscalls in the sandbox, I don't think. And I've run it inside of the same rootfs on the same machine (even with a nested sandbox, like we do on CI) and it stays "fast". |
Is buildkite still misconfigured to run all of the tests with massive numbers of threads? |
Buildkite is configured to set |
For So on the |
We could also decrease the maximum possible value of Right now, when running the tests with If we want, we could further decrease that upper bound. |
Okay; so I have confirmed that setting The bad news is that, as far as I can tell, setting |
This is good news! |
The weird part is that we also set |
It was single threaded, until you forced it to launch 16x the number of threads intended and turned it into a parallelism benchmark instead. |
What is the recommended way to ensure that Julia never launches more than |
So, in order to ensure it doesn't sometimes launch more than 128 threads sometimes, you force it to instead launch 256 always? I don't get this logic. |
Closing as this was fixed in buildkite |
rr
?ccall
testsrr
trace filesamdci5
, manualamdci5
, manualamdci5
, Buildkiteamdci5
, Buildkiteccall
)amdci5
, Buildkiteccall
test set)Notes:
ccall
tests pass.manual
means that we are not using any kind of sandboxing.The text was updated successfully, but these errors were encountered: