Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Thread oversubscription on multicore systems #296

Open
headmeister opened this issue Oct 17, 2022 · 1 comment
Open

Thread oversubscription on multicore systems #296

headmeister opened this issue Oct 17, 2022 · 1 comment

Comments

@headmeister
Copy link

Hello, I encountered this error on our machine (bart v0.8 compiled from source) when using the ecalib through the python wrapper. Our machine has 128 cores and it failed with stating:

BLAS : Program is Terminated. Because you tried to allocate too many memory regions.
Segmentation Fault.

This problem is most likely related to this openBLAS issue : autogluon/autogluon#1020
When I limited the number of threads for openBLAS and OMP via an environ. variable to 32, it fixed itself. This might be an issue for others too, when bart is executed on bigger machines. I know this might not be the final solution, but can at least help in running Bart.

Best Regards,
Jiri

@uecker
Copy link
Member

uecker commented Oct 22, 2022

Thanks for pointing this out!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants