-
Notifications
You must be signed in to change notification settings - Fork 231
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Mainnet crashed with the message SIGSEGV: Illegal storage access. #2134
Comments
Can confirm my validator node crashed with the same issue |
I also had same issue (2 mainnet crashes so far) |
Can confirm too
(graffiti masked)
|
Can confirm another crash with this issue. The same OS and architecture as the above comment. It is also the same machine which crashed with a RangeError. Full versioning info:
Logs:
|
Just happened again with v1.0.3 INF 2020-12-18 03:08:19.001+08:00 Slot end topics="beacnde" tid=1722 file=nimbus_beacon_node.nim:647 slot=117339 nextSlot=117340 head=d3abe649:117339 headEpoch=3666 finalizedHead=78112719:117247 finalizedEpoch=3663 |
Reproduced the issue again with v1.0.4 |
I just reproduced the issue with v1.0.6 with the following log: 29980 head=945cf5a9:329979 headEpoch=10311 finalizedHead=a60c25e3:329888 finalizedEpoch=10309 |
@kdeme I think this might be related to the sigsegv we fixed. Although logs are not so helpful. Do you think that's a possibility? |
My validator crashed with v1.0.7 The error message is different though. 484896 head=2d1fddd2:484895 headEpoch=15152 finalizedHead=68713699:484800 finalizedEpoch=15150 |
@hylin911 please open a new issue. That's not a SIGSEGV at all, and the causes are very different. That said, that assertion isn't in v1.0.7 at all, according to https://github.com/status-im/nimbus-eth2/blob/v1.0.7/beacon_chain/spec/network.nim. 55ecb61#diff-0858753285dc19b7770d771e748b77d477ba6a312ce4ed36df2603d242cd4b88 / #2240 removed it, three weeks ago. Specifically what commit are you on, and how did you update to v1.0.7? |
Yup, I just found that I am on the tag v1.0.6. I used the command "git pull && make update" and thought I moved to v1.0.7... I will make another build and test it. |
See what branch you're on, with |
Crashed on v1.0.7 now hy@hyeth-NUC8i7BEH:~/nimbus-eth2$ git log
490979 head=aa656436:490978 headEpoch=15343 finalizedHead=4d044a6c:490912 finalizedEpoch=15341 |
Just another crash 494472 head=b8f943ae:494471 headEpoch=15452 finalizedHead=d755916f:494400 finalizedEpoch=15450 |
You need to make sure you're either on the |
We could use some GDB backtraces. If you're using a wrapper script, edit "scripts/run-beacon-node.sh" and change line 78 from When you're dropped in the GDB prompt, run the |
Are you all using Geth? |
Yes, I am running Geth on the same machine. |
Hi @stefantalpalaru , I just have time to test v1.0.7 again. It crashed after running about 3 hours. Below is the bt Thread 1 "nimbus_beacon_n" received signal SIGSEGV, Segmentation fault.
Quit anyway? (y or n) y |
Hmm.. I think I made another build while the problematic one is running. Let me test again to see if I can recover the stack. |
Thanks! Please try a |
This should practically solve the segfaults we have been seeing in issue #2134
This should practically solve the segfaults we have been seeing in issue #2134
No more crashes reported since the Nim GC fixes were applied so I'll close this. |
Describe the bug
My validator crashed with the log.
To Reproduce
Steps to reproduce the behavior:
Platform details (OS, architecture):
Ubuntu, intel NUC, I run my own geth on the same machine.
Branch/commit used:
N/A, I git pull just before mainnet launch
Commands being executed:
./run-mainnet-beacon-node.sh
Relevant log lines:
INF 2020-12-02 11:00:50.662+08:00 Slot end topics="beacnde" tid=19715 file=nimbus_beacon_node.nim:593 slot=4502 nextSlot=4503 head=2de96dac:4502 headEpoch=140 finalizedHead=a30f7f9f:4416 finalizedEpoch=138
INF 2020-12-02 11:00:59.047+08:00 Slot start topics="beacnde" tid=19715 file=nimbus_beacon_node.nim:505 lastSlot=4502 scheduledSlot=4503 beaconTime=15h36s47ms933us254ns peers=160 head=2de96dac:4502 headEpoch=140 finalized=a30f7f9f:4416 finalizedEpoch=138
INF 2020-12-02 11:01:01.446+08:00 Slot end topics="beacnde" tid=19715 file=nimbus_beacon_node.nim:593 slot=4503 nextSlot=4504 head=999cda45:4503 headEpoch=140 finalizedHead=a30f7f9f:4416 finalizedEpoch=138
INF 2020-12-02 11:01:11.040+08:00 Slot start topics="beacnde" tid=19715 file=nimbus_beacon_node.nim:505 lastSlot=4503 scheduledSlot=4504 beaconTime=15h48s40ms196us286ns peers=160 head=999cda45:4503 headEpoch=140 finalized=a30f7f9f:4416 finalizedEpoch=138
INF 2020-12-02 11:01:12.679+08:00 Slot end topics="beacnde" tid=19715 file=nimbus_beacon_node.nim:593 slot=4504 nextSlot=4505 head=6dce0a30:4504 headEpoch=140 finalizedHead=a30f7f9f:4416 finalizedEpoch=138
INF 2020-12-02 11:01:23.044+08:00 Slot start topics="beacnde" tid=19715 file=nimbus_beacon_node.nim:505 lastSlot=4504 scheduledSlot=4505 beaconTime=15h1m44ms529us58ns peers=160 head=6dce0a30:4504 headEpoch=140 finalized=a30f7f9f:4416 finalizedEpoch=138
INF 2020-12-02 11:01:23.542+08:00 Slot end topics="beacnde" tid=19715 file=nimbus_beacon_node.nim:593 slot=4505 nextSlot=4506 head=88ad56cb:4505 headEpoch=140 finalizedHead=a30f7f9f:4416 finalizedEpoch=138
peers: 160 ❯ finalized: a30f7f9f:138 ❯ head: 88ad56cb:140:25 ❯ time: 140:25 (4505) ❯ sync: synced ETH: 96.31415611 Traceback (most recent call last, using override)
/home/hy/nimbus-eth2/vendor/nimbus-build-system/vendor/Nim/lib/system/excpt.nim(614) signalHandler
SIGSEGV: Illegal storage access. (Attempt to read from nil?)
Screenshots
If applicable, add screenshots to help explain your problem.
Additional context
I had seen the same crash before mainnet launches, thus I git pull and built the latest. So this is the second time I am seeing this. One before mainnet launch, one after.
The text was updated successfully, but these errors were encountered: