Best practice for using max 2 cores on CRAN? #5658

tdhock · 2023-06-14T20:07:11Z

CRAN has a policy that limits the number of cores that can be used during package checks, and they have recently implemented a mechanism for checking compliance. Upon recent CRAN submission of one of my R packages which imports data.table, I got the following rejection message from CRAN:

Flavor: r-devel-linux-x86_64-debian-gcc
Check: examples, Result: NOTE
  Examples with CPU (user + system) or elapsed time > 5s
                         user system elapsed
  aum_line_search      12.349  0.322   1.935
  aum_line_search_grid 10.033  0.308   1.781
  Examples with CPU time > 2.5 times elapsed time
                         user system elapsed ratio
  aum_line_search      12.349  0.322   1.935 6.548
  aum_line_search_grid 10.033  0.308   1.781 5.806
  aum_diffs_penalty     4.730  0.169   1.635 2.996

These messages can be suppressed by adding data.table::setDTthreads(1) at the start of each example.
Is there another/recommended way to avoid using too many cores when checking a CRAN package which uses data.table? The default number of threads in data.table is max cores/2 which is over the CRAN limit (max 2 cores I believe).
Also is there documentation about this somewhere? Probably would be good to mention in the datatable-importing vignette.

The text was updated successfully, but these errors were encountered:

eddelbuettel · 2023-06-15T02:55:16Z

they have recently implemented a mechanism for checking compliance

I don't think it is that recent. Package tiledb got dinged three or so years ago, and I then implemented a scheme that works.

Here is what the package does now. In essence, in every example file or unit test file (i.e. files that CRAN runs) I explicitly call a function that throttles cores down to

the given function value, if there is one, in this throttler function
or the value of option Ncpu with a fallback value of 2 (and that is what CRAN gets; as I recall they set Ncpu)
or the value of environment variable OMP_THREAD_LIMIT common to set OpenMP and MKL thread counts.

So this gets us the max of 2 in everything that CRAN runs yet it can be overriden (if, they, your CI has more cores). I already had Ncpu set locally to e.g. install multiple packages in parallel via install.packages() / update.package() ).

Importantly, it leaves general startup (i.e. .onLoad(), .onAttach() ) alone: it still gets full core count and performance.

So normal users are not affected and get full speed, but CRAN gets its limit. I can detail that some more you if you care; it is in tiledb file R/config.R (but a little tied to how the config object progagates for us).

jangorecki · 2023-06-15T06:46:23Z

Also reading DT news file can point to commits that set that up for DT

tdhock · 2023-06-15T21:58:34Z

@eddelbuettel thanks for the quick feedback, that approach seems similar to what I did, but more flexible. You wrote that CRAN sets option Ncpu -- do you know where is that documented? On https://cran.r-project.org/web/packages/policies.html I see "If running a package uses multiple threads/cores it must never use more than two simultaneously: the check farm is a shared resource and will typically be running many checks simultaneously. " but no mention of option Ncpu.

Also I was thinking that if CRAN indeed sets option Ncpu then the data.table default number of threads should respect that (avoids having to change example/test code in 1000+ packages with hard dependency on data.table).

@jangorecki I checked https://github.com/Rdatatable/data.table/blob/master/NEWS.md but I did not see mention of commits, can you please clarify?

tdhock · 2023-06-15T22:00:52Z

An alternative to option Ncpu would be to just detect if on CRAN, for example using code below, and then throttle to two cores for data.table default.

> testthat:::on_cran
function () 
{
    !env_var_is_true("NOT_CRAN")
}

I do not see mention of NOT_CRAN env var on CRAN repository policy either, do you know where that is documented?

eddelbuettel · 2023-06-15T22:04:05Z

Relying on absence of NOT_CRAN has always been and still is a hack I do not recommended, or use. YYMV.

I do not recall where they document but it is documented somewhere that they allow two threads and AFAICR Ncpus is the one vessel for that payload.

tdhock · 2023-06-16T01:37:30Z

There is a related R-devel thread, https://stat.ethz.ch/pipermail/r-devel/2021-November/081289.html that explains the env var _R_CHECK_LIMIT_CORES_=TRUE is set when running R CMD check --as-cran.

tdhock · 2023-06-16T01:41:09Z

?tools::check_packages_in_dir says Ncpus option is used to specify how many packages are checked in parallel,

   Ncpus: the number of parallel processes to use for parallel
          installation and checking.

eddelbuettel · 2023-06-16T01:49:11Z

But note that what you quoted does not limit it to package checks as your statement implies but to general 'checking'. Which is where we started: how to play nice at CRAN and not exceed limits.

tdhock · 2023-06-16T01:50:07Z

This check is implemented via this code https://github.com/wch/r-source/blob/1c0545ba7c6c07e8c358eda552b875b1e4d6826d/src/library/tools/R/check.R#L4123 which gets the 2.5 ratio from the environment variable _R_CHECK_EXAMPLE_TIMING_CPU_TO_ELAPSED_THRESHOLD_ so we could check to see if that is set and then round down as a default for number of threads.

eddelbuettel · 2023-06-16T01:55:12Z

Also:

data.table/src/openmp-utils.c

Line 95 in 8803918

    
           Rprintf(_("  OMP_THREAD_LIMIT               %s\n"), mygetenv("OMP_THREAD_LIMIT", "unset"));  // CRAN sets to 2

Recall that my approach was about taking the lower limit from Ncpu amd OMP_NUM_THREADS (and I no longer recall why I do not / did not also set OMP_THREAD_LIMIT).

Also

data.table/man/openmp-utils.Rd

Line 27 in 8803918

    
           The number of logical CPUs is determined by the OpenMP function \code{omp_get_num_procs()} whose meaning may vary across platforms and OpenMP implementations. \code{setDTthreads()} will not allow more than this limit. Neither will it allow more than \code{omp_get_thread_limit()} nor the current value of \code{Sys.getenv("OMP_THREAD_LIMIT")}. Note that CRAN's daily test system (results for data.table \href{https://cran.r-project.org/web/checks/check_results_data.table.html}{here}) sets \code{OMP_THREAD_LIMIT} to 2 and should always be respected; e.g., if you have written a package that uses data.table and your package is to be released on CRAN, you should not change \code{OMP_THREAD_LIMIT} in your package to a value greater than 2.

tdhock · 2023-06-16T01:58:18Z

related to #5573 #5620 about other environment variables to look at to determine default number of threads.

TimTaylor · 2023-06-16T11:03:10Z

I also ran in to this on a submission this week and so did others recently (see r-devel thread where Dirk gives similar guidance).

I was confused as I assumed this would always be handled on the data.table side. I wonder if the CRAN test machine is no longer setting the OMP_THREAD_LIMIT environment variable which data.table is assuming it does?

jangorecki · 2023-06-16T13:59:50Z

I haven't found in NEWS file anything useful as well.
#3300 seems to be related

eddelbuettel · 2023-06-16T14:04:17Z

Pondering this a little more, I think this may not really belong here as data.table has long done 'The Right Thing (TM)'.

What may help @tdhock, @TimTaylor and likely others is a new helper function to throttle cores when running tests or examples in their packages. And while those packages may well use data.table, this is a general problem at CRAN so shouldn't there be a generic little helper somewhere in a (preferably zero-dependency) package?

tdhock · 2023-06-20T01:43:04Z

#3300 quotes an email from Brian Ripley saying that CRAN sets OMP_THREAD_LIMIT=2, but that was back in 2019, so I suspect that @TimTaylor is right that CRAN is no longer doing that, but they are clearly setting _R_CHECK_EXAMPLE_TIMING_CPU_TO_ELAPSED_THRESHOLD_=2.5 because that is the source of the NOTE in my original post, so I would suggest using this env var instead.

tdhock · 2023-07-17T17:59:16Z

#3300 (comment) is an issue originally opened in 2019 with a new comment in July 2023 quoting Uwe Ligges saying that CRAN no longer sets the OMP_THREAD_LIMIT env var, so I suggest we instead look at _R_CHECK_EXAMPLE_TIMING_CPU_TO_ELAPSED_THRESHOLD_ and round down.

eddelbuettel · 2023-07-17T18:21:00Z

Your call between a second-hand quote (!!) in a 2019 ticket (!!) on one hand and the CRAN Repository Policy on the other.

The latter still says

If running a package uses multiple threads/cores it must never use more than two simultaneously: the check farm is a shared resource and will typically be running many checks simultaneously.

Examples should run for no more than a few seconds each: they are intended to exemplify to the would-be user how to use the functions in the package.

* version bump * Documented arguments not in \usage * changes for CRAN check; see Rdatatable/data.table#5658 * avoid tests on cran * remove old checks; no longer supported in current python

lrberge · 2023-08-24T10:09:01Z

Is there something against placing:

if(any(grepl("_R_CHECK", names(Sys.getenv()), fixed = TRUE))){
  setDTthreads(2)
}

in data.table's onLoad?

It would save all dependent package maintainers to do that down the road. But maybe I'm missing something!

TimTaylor · 2023-08-24T10:25:42Z

The impression I get is that CRAN are nudging for packages to default to being single (or very low) threaded unless the package user explicitly says otherwise. related discussions as to whether users are even aware of the default behaviour in #5620 (comment).

EDIT: Add a link to the most recent R-package-devel thread with associated comments from Uwe, Dirk et al https://stat.ethz.ch/pipermail/r-package-devel/2023q3/009454.html

eddelbuettel · 2023-08-24T12:12:16Z

@lrberge That is pretty clever but it would bite folks like me who have values in a dotfile for R CMD check:

> grep("_R_", names(Sys.getenv()), value=TRUE)
[1] "_R_CHECK_COMPILATION_FLAGS_KNOWN_" "_R_CHECK_TESTS_NLINES_"           
>

It would be better if we could nudge Uwe towards setting the OMP variable on his machine! He is the one who wants the lower core and process count there!

lrberge · 2023-08-24T13:14:51Z

@eddelbuettel how come you have "_R_CHECK" env vars lying around?! :-)
I agree the cleanest solution would be the OMP variable.

@TimTaylor: the current default for many packages with multi threading is usually to max out resources (or close to) which in many cases isn't the best.
So nudging users to set it themselves can be a solution. The big issue is... it makes R very user unfriendly.

# A)
library(data.table)
setDTthreads(percent = 0.5)

# B)
library(data.table)

Running A) instead of B) is really a pain and nobody will do it: this will follow that R multi-core-disabled functions will look slow on large tasks as compared to other non-R software.
Placing A) in an .Rprofile is too much to ask for new R users.
If R takes the single core path, it must be written in bold and large font how to change that (in particular the .Rprofile mechanism) in as many places as possible, otherwise new users simply will be disappointed with performance on demanding tasks and won't look further.

So in my view: a) using all available (or many) threads as a default isn't great, b) using single thread mode as default and nudging users to change that has big caveats and isn't really user-firendly. So maybe the way is c)? c) "smart" thread setting, with the number of threads decided using the parameters of the task to achieve? (A bit like the throttle argument of setDTthread but with more detailed heuristics.)
It seems to me that currently the official R core team position and direction isn't really clear (yet).

jangorecki · 2023-08-24T13:45:26Z

A) is already the default one, no need to specify that value

eddelbuettel · 2023-08-24T14:04:13Z

using all available (or many) threads as a default isn't great

Wit my "work" hat on: I disagree. We build software for 'big enough' problems and e.g. our underlying library maxes out by default. I strongly believe that is the right thing. (And in our package I put a throttler in as described above).

So I actually hold both views: I like what data.table does as a default and would not want it to default to 2, but as a widely-used package it would also be nice to help other packages using it.

Back to: How do we get Uwe to change his mind? 😁

jangorecki · 2023-08-24T14:49:52Z

Most critical pieces of DT openmp does not speed up linearly with increase of number of cores. We observed that difference between 50% and 100% was not very big and decided to stay on 50% default to avoid problems on shared machines that have been reported couple times already. Although there are other pieces in DT which will scale more linearly, like froll, and here having 100% would be better default.

aitap · 2023-10-01T08:21:32Z

In addition to what @jangorecki says:

If you're using parallel-style clusters together with data.table, you need to split your core allowance between the two. This means either makeCluster(1) and setDTthreads(2) or the other way around. (This problem isn't novel or unique to data.table. Anyone with OpenBLAS/MKL/other parallel BLAS and a parallel cluster has already had to manage it somehow.)

It's best to minimise changes to global state from the examples. Start your examples with \dontshow{.prev.dt.threads <- data.table::setDTthreads(2)} and end them with \dontshow{data.table::setDTthreads(.prev.dt.threads); rm(.prev.dt.threads)}. (Use a variable name starting with a dot because those are by convention not for the user to care about despite being present in the global environment.) Otherwise a user who had set up their DT threads to their liking will be surprised to see data.table slowed down again after running example(your_function).

EmilHvitfeldt · 2023-10-18T21:38:43Z

@jangorecki, your suggested solutions does not work in my use case.

with tidymodels/textrecipes#251 I still get the CRAN submission NOTE with

* checking examples ... [17s/12s] NOTE
Examples with CPU time > 2.5 times elapsed time
                 user system elapsed ratio
step_dummy_hash 1.579   0.11   0.273 6.187
* checking for unstated dependencies in ‘tests’ ... OK
* checking tests ... [24s/19s] OK
  Running ‘testthat.R’ [24s/18s]

I don't know if it is because I don't use {data.table} directly and instead using a package {text2vec} that uses {data.table}.

Last release I ended up simply deleting the documentation for the affected functions, but that is not a long-term solution.

jangorecki · 2023-10-19T07:50:05Z

@EmilHvitfeldt hard for me to imagine why that could still cause problem, unless there is some nested parallelism, like mclapply + data.table, then inside topmost parallel code should be call to set any nested code to be single threaded, not just for data.table but for any multi threaded computations. Are you able to narrow down from where parallel processing comes?

TimTaylor · 2023-10-19T08:16:23Z

@EmilHvitfeldt - Are you seeing this on all platforms? Not dug further but text2vec has this in R/zzz.R:

.onLoad = function(libname, pkgname) {
  n_cores = 1L
  if(.Platform$OS.type == "unix")
    n_cores = parallel::detectCores(logical = FALSE)
  options("text2vec.mc.cores" = n_cores)

  logger = lgr::get_logger('text2vec')
  logger$set_threshold('info')
  assign('logger', logger, envir = parent.env(environment()))
}

As @jangorecki alludes, could there be more going on than just data.table?

eddelbuettel · 2023-10-19T09:17:08Z

It again demonstrates that CRAN was not really helpful here by insisting each and every package fix that on their own.

cdriveraus · 2023-10-19T09:50:49Z

given that they are considering a general solution to the problem, waiting for that before rigidly enforcing the 2 core thing would have seemed sensible to me...

eddelbuettel · 2023-10-19T10:13:11Z

@cdriveraus News to me. Can you supply a reference to back up that clain? Last we all saw was Uwe Ligges telling everybody to set up 'max 2 cores' in each package needing it.

cdriveraus · 2023-10-19T10:16:13Z

@eddelbuettel I thought it was mentioned somewhere on the pkg devel list at one point, but I'm guessing you're more familiar than me so I'm assuming I misinterpreted / imagined one of the posts I read...

EmilHvitfeldt · 2023-10-19T16:38:51Z

One of the frustration part of this problem is that I'm not able to reproduce this problem locally. Unless broken, the {text2vec} parallelism is only for certain functions which I'm not using (as far as I know).

Only seeing it on the Debian incoming check machine.

right now my examples have the following to no avail.

#' \dontshow{library(data.table)}
#' \dontshow{data.table::setDTthreads(2)}
#' \dontshow{Sys.setenv("OMP_THREAD_LIMIT" = 2)}
#' \dontshow{library(text2vec)}
#' \dontshow{options("text2vec.mc.cores" = 1)}

Also I very much appreciate all the help that is coming in this thread!

helske · 2023-10-24T10:42:47Z

I think there is something else going on on CRAN than just the data.table behaviour. I have similar problems with the bssm package, which does not use data.table, but uses OpenMP in some selected functions where the number of threads is set by the the user in the R function call (default being one). I tried using Sys.setenv("OMP_THREAD_LIMIT" = 2) at the start of the problematic example, but I still get the same pretest check note from CRAN on Debian.

jangorecki · 2023-10-24T11:22:00Z

yes, problem is not with data.table but openmp (or other multithreading code).

it is interesting that even in your case where it uses single thread by default, it still having the problem. Then I don't think there is any reliable solution that package maintainers can employ.

Rdatatable/data.table#5658

comply with directives from CRAN see Rdatatable/data.table#5658

stitam · 2024-02-16T20:48:51Z

Not sure this a proper solution, but this is how I handle the issue within the webseq package. The package is not yet on CRAN but it does pass devtools::check() with the default cran = TRUE.

In each of my functions that use parallelisation, I set the default value for the number of cores to NULL and validate the number of cores with an internal get_mc_cores() function which 1. returns the number of cores if it has been set, 2. otherwise looks at getOption("Ncpu") and 3. if that returns NULL, falls back to parallel::detectCores().

Within a regular R session I do not set "Ncpu" so R will not find it and will fall back to using many cores by default. However, I start each of my test files with options("Ncpu" = 2L) so R CMD check will always work with 2 cores.

Do you folks think this will work on CRAN? Thanks.

jangorecki · 2024-02-17T07:30:02Z

DT uses srtDTthreads rather than global options. Please read manual.

aitap · 2024-02-17T07:43:20Z

R CMD check also runs example() for all your help pages, so any examples that use parallel processing must pass mc_cores = at most 2. Do not set options(Ncpu=2L) in examples because the user may have set it to a different value they are more comfortable with, and running such an example would change user's observable global state. What will your code do if parallel::detectCores() returns NA?

stitam · 2024-02-17T19:50:14Z

Thanks @aitap! Oh, some of my examples were wrapped in dontrun{} so they were not evaluated by R CMD check.. I unwrapped them to see what happens and R CMD check threw an error, as expected. I'm okay with wrapping, and AFAIK it is not against CRAN policies, but if that's not acceptable, then unfortunately setting options("Ncpu" = 2L) within tests is not enough to pass the checks. Thanks for flagging the NA, currently nothing. I am unsure exactly when parallel::detectCores() returns NA, so I'll probably just add stop() for now.

aadler · 2024-02-22T12:06:23Z

You may want to look into the parallelly package's availableCores function which is written to always return an integer. Increases your dependency count but may be worth it.

tdhock added openmp omp labels Jun 16, 2023

jansim mentioned this issue Aug 16, 2023

Release occupationMeasurement 0.3.1 & 0.3.2 on CRAN occupationMeasurement/occupationMeasurement#8

Closed

23 tasks

topepo added a commit to tidymodels/parsnip that referenced this issue Aug 17, 2023

changes for CRAN check; see Rdatatable/data.table#5658

2c5e1c8

sritchie73 mentioned this issue Aug 23, 2023

getDTthreads not limiting threads on CRAN debian server #5677

Closed

brown-jason mentioned this issue Oct 10, 2023

tcplfit2 fails cran prechecks USEPA/CompTox-ToxCast-tcplFit2#58

Closed

helske mentioned this issue Oct 24, 2023

CRAN complains about multithreading helske/bssm#33

Open

certara-jcraig mentioned this issue Oct 30, 2023

R cmd check notes certara/tidyvpc#55

Merged

jangorecki removed the omp label Nov 7, 2023

ben-schwen mentioned this issue Nov 29, 2023

Warning on CRAN: cannot form a team with 24 threads, using 2 instead #3300

Closed

nikosbosse mentioned this issue Nov 29, 2023

Issue 486: Fix NOTEs for CRAN submission epiforecasts/scoringutils#514

Merged

9 tasks

jameslamb mentioned this issue Dec 6, 2023

[R-package] [c++] add tighter multithreading control, avoid global OpenMP side effects (fixes #4705, fixes #5102) microsoft/LightGBM#6226

Merged

polettif added a commit to r-transit/tidytransit that referenced this issue Dec 7, 2023

resolve CRAN notes about data.table threads on Debian

d478154

Rdatatable/data.table#5658

polettif mentioned this issue Dec 7, 2023

Fix tests, update contributors, prepare release r-transit/tidytransit#212

Merged

tdhock mentioned this issue Dec 7, 2023

take default threads from CRAN env var #5807

Closed

levenc added a commit to levenc/posologyr that referenced this issue Dec 8, 2023

limit the number of cores used in tests and vignettes

789ec64

comply with directives from CRAN see Rdatatable/data.table#5658

eitsupi mentioned this issue Dec 24, 2023

how to config max number of threads? pola-rs/r-polars#241

Closed

ruthkr added a commit to ruthkr/greatR that referenced this issue Feb 8, 2024

Fix NOTEs for CRAN submission (see Rdatatable/data.table#5658)

d284f87

MikeKSmith-Pfizer mentioned this issue Feb 12, 2024

Refactor to change from grid = TRUE/FALSE to ncpus for user to specify number of CPUs in analyzeData. MikeKSmith/MSToolkit#21

Open

Best practice for using max 2 cores on CRAN? #5658

Best practice for using max 2 cores on CRAN? #5658

Comments

tdhock commented Jun 14, 2023 • edited Loading

eddelbuettel commented Jun 15, 2023

jangorecki commented Jun 15, 2023

tdhock commented Jun 15, 2023

tdhock commented Jun 15, 2023 • edited Loading

eddelbuettel commented Jun 15, 2023 • edited Loading

tdhock commented Jun 16, 2023 • edited Loading

tdhock commented Jun 16, 2023

eddelbuettel commented Jun 16, 2023

tdhock commented Jun 16, 2023

eddelbuettel commented Jun 16, 2023 • edited Loading

tdhock commented Jun 16, 2023

TimTaylor commented Jun 16, 2023

jangorecki commented Jun 16, 2023

eddelbuettel commented Jun 16, 2023

tdhock commented Jun 20, 2023

tdhock commented Jul 17, 2023 • edited Loading

eddelbuettel commented Jul 17, 2023

lrberge commented Aug 24, 2023

TimTaylor commented Aug 24, 2023 • edited Loading

eddelbuettel commented Aug 24, 2023

lrberge commented Aug 24, 2023

jangorecki commented Aug 24, 2023

eddelbuettel commented Aug 24, 2023

jangorecki commented Aug 24, 2023 • edited Loading

aitap commented Oct 1, 2023 • edited Loading

EmilHvitfeldt commented Oct 18, 2023

jangorecki commented Oct 19, 2023

TimTaylor commented Oct 19, 2023

eddelbuettel commented Oct 19, 2023

cdriveraus commented Oct 19, 2023

eddelbuettel commented Oct 19, 2023

cdriveraus commented Oct 19, 2023

EmilHvitfeldt commented Oct 19, 2023 • edited Loading

helske commented Oct 24, 2023

jangorecki commented Oct 24, 2023

stitam commented Feb 16, 2024

jangorecki commented Feb 17, 2024

aitap commented Feb 17, 2024 via email

stitam commented Feb 17, 2024 • edited Loading

aadler commented Feb 22, 2024

tdhock commented Jun 14, 2023 •

edited

Loading

tdhock commented Jun 15, 2023 •

edited

Loading

eddelbuettel commented Jun 15, 2023 •

edited

Loading

tdhock commented Jun 16, 2023 •

edited

Loading

eddelbuettel commented Jun 16, 2023 •

edited

Loading

tdhock commented Jul 17, 2023 •

edited

Loading

TimTaylor commented Aug 24, 2023 •

edited

Loading

jangorecki commented Aug 24, 2023 •

edited

Loading

aitap commented Oct 1, 2023 •

edited

Loading

EmilHvitfeldt commented Oct 19, 2023 •

edited

Loading

stitam commented Feb 17, 2024 •

edited

Loading