-
-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
StackOverflowError when creating RemoteChannels #30679
Comments
I've run into the same issue when using
The way it is crashes seems to vary depending on the platform I'm running it on. On linux (Ubuntu 16.04.4) it does not crash, but hangs on the RemoteChannel line while slowly ramping memory use to 100%. On mac I either get a similar error:
or a segmentation fault. |
I just tested this in 1.1.0-rc2.0 and on macOS the StackOverflowError I was getting in 1.0.3 has changed to
|
1bd2334 is the first bad commit according to git bisect, reverting the commit from v1.1.0 fixes the issue for me. |
Thanks for doing the bisect. I don't know how that could have caused this but I'll look into it. |
I can reproduce this on 1.0.3 but not on master. |
Ok, progress. This is hitting this case: Line 53 in e87b19b
deserialize method to call itself instead of the intended invoke target. @vtjnash changing it to just unconditionally call fptr fixes the problem. I see from the comment you may have intended a different fix, but could we change this for now?
|
Thanks for looking at this @JeffBezanson . In case it's useful information, I can reproduce this on master. Built a fresh clone from source yesterday (Commit e87b19b) on Ubuntu 16.04. Running at @dwfmarchant's example from earlier in this thread I get:
Like @martinbiel I've found that this error occurs when I have some packages loaded but not others. I can't find any common denominator among the packages that cause this crash. It's been cropping up in some of my company's parallel julia codes when trying to compute values on remote worker julia processes and store them to
Like @GAIKA this problem goes away if I revert commit 1bd2334, starting from yesterday's master in my case. |
I have noticed some strange behavior with RemoteChannel in 1.0.3, that did not occur for me in 1.0.2. I think I have identified a MWE. The following works:
However, if I import some module before the struct definition, the following happens:
The error repeats consistently after consecutive attempts of
RemoteChannel(()->Channel{A}(1), 2)
, but if I run some code snippet involving theA
type on the second node I can suddenly create channels again:The error does not occur after any model import. For example,
MacroTools
,Statistics
or seemingly any standard library does not lead to this error. Other large modules that I have tried that does lead to the error arePlots
andDistributions
. I have not found any common denominator between model imports that cause this error. My version info:The text was updated successfully, but these errors were encountered: