-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Segfault in HGCalCLUEAlgoT<...>::makeClusters() #42025
Comments
A new Issue was created by @makortel Matti Kortelainen. @Dr15Jones, @perrotta, @dpiparo, @rappoccio, @makortel, @smuzaffar can you please review it and eventually sign/assign? Thanks. cms-bot commands are listed here |
assign reconstruction, upgrade FYI @cms-sw/hgcal-dpg-l2 |
New categories assigned: upgrade,reconstruction @AdrianoDee,@mandrenguyen,@clacaputo,@srimanob you have been requested to review this Pull request/Issue and eventually sign? Thanks |
@makortel thanks for reporting on this. We will follow up on this ASAP. |
Seen again in WF25234.911 step3 on slc7_amd64_gcc11 CMSSW_13_3_X_2023-09-13-2300. Apparently very rare, but this at least lets us eliminate some possible thread interactions.
|
Happened again: link
|
|
New stack trace, somewhat different from the old ones. WF 25234.911, el9_amd64_gcc12, CMSSW_14_0_X_2023-11-19-2300
|
@dan131riley can we have the input file so we can reproduce the issue? |
Probably the same issue is seed in UBSANR630_X IB. Also I found an old issue that looks similar: #41731. |
Any update on this? |
Occurred in CMSSW_14_1_X_2024-02-23-2300 on el8_ppc64le_gcc12
|
Now also a floating point exception in CMSSW_14_1_NONLTO_X_2024-03-24-0000 (although I'm puzzled what enabled those)
|
we have seen external packages messing with the FPE state, that's why we added it to the signals we trap in #39474 |
Thanks Dan. |
Occured in CMSSW_14_1_X_2024-05-05-2300 for slc7_amd64_gcc12:
|
Occured in CMSSW_14_1_DBG_X_2024-05-09-2300 for el8_amd64_gcc12:
(stacktrace is incomplete: |
Occured in CMSSW_14_1_ROOT6_X_2024-06-05-2300 for el8_aarch64_gcc12:
|
type hgcal |
this should be fixed now by #45178 |
Worklofw 25234.911 step 2 failed in CMSSW_13_2_ROOT628_X_2023-06-19-2300 on el8_amd64_gcc11
https://cmssdt.cern.ch/SDT/cgi-bin/logreader/el8_amd64_gcc11/CMSSW_13_2_ROOT628_X_2023-06-19-2300/pyRelValMatrixLogs/run/25234.911_TTbar_14TeV+2026D99_DD4hep/step2_TTbar_14TeV+2026D99_DD4hep.log#/
The text was updated successfully, but these errors were encountered: