Skip to content

Commit

Permalink
Improve threads performance when running under rr (#35527)
Browse files Browse the repository at this point in the history
Partr does a user space busy loop when no events are found (until
some timeout expires), in the expectation that another thread might push some.
When running under rr, all execution is serialized, so there's no other
thread running simultaneously, and rr will happily let the thread busy loop until
its timeslice (50ms or so) is expired. The result is a dramatic
performance decrease in the threads test that is entirely unnecessary
(even during regular execution it's not clear that busy waiting
makes much sense on low-core CPUs, since we're potentially starving
another thread of the execution time it needs to actually schedule
some work). For now, just detect that we're running under rr and stop
doing any sort of busy waiting in that case.
  • Loading branch information
Keno authored Apr 22, 2020
1 parent ad87100 commit f7e5f3e
Showing 1 changed file with 35 additions and 0 deletions.
35 changes: 35 additions & 0 deletions src/partr.c
Original file line number Diff line number Diff line change
Expand Up @@ -275,10 +275,45 @@ JL_DLLEXPORT int jl_enqueue_task(jl_task_t *task)
}


static int running_under_rr(void)
{
#ifdef _OS_LINUX_
#define RR_CALL_BASE 1000
#define SYS_rrcall_check_presence (RR_CALL_BASE + 8)
static int checked_running_under_rr = 0;
static int is_running_under_rr = 0;
if (!checked_running_under_rr) {
int ret = syscall(SYS_rrcall_check_presence, 0, 0, 0, 0, 0, 0);
if (ret == -1) {
// Should always be ENOSYS, but who knows what people do for
// unknown syscalls with their seccomp filters, so just say
// that we don't have rr.
is_running_under_rr = 0;
}
else {
is_running_under_rr = 1;
}
checked_running_under_rr = 1;
}
return is_running_under_rr;
#else
return 0;
#endif
}


// sleep_check_after_threshold() -- if sleep_threshold ns have passed, return 1
static int sleep_check_after_threshold(uint64_t *start_cycles)
{
JULIA_DEBUG_SLEEPWAKE( return 1 ); // hammer on the sleep/wake logic much harder
/**
* This wait loop is a bit of a worst case for rr - it needs timer access,
* which are slow and it busy loops in user space, which prevents the
* scheduling logic from switching to other threads. Just don't bother
* trying to wait here
*/
if (running_under_rr())
return 1;
if (!(*start_cycles)) {
*start_cycles = jl_hrtime();
return 0;
Expand Down

2 comments on commit f7e5f3e

@nanosoldier
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Executing the daily benchmark build, I will reply here when finished:

@nanosoldier runbenchmarks(ALL, isdaily = true)

@nanosoldier
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @ararslan

Please sign in to comment.