-
Notifications
You must be signed in to change notification settings - Fork 4.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
# Fatal signal 11 (SIGSEGV), code 1 (SEGV_MAPERR), fault addr ************ in tid **** (SGen worker) #105389
Comments
@dcrespo-isaac thanks for the logs! They universally show that the issue is somewhere in the MonoVM's GC (SGEN), so I think the issue belongs in |
Tagging subscribers to this area: @BrzVlad |
@BrzVlad looks like a GC issue |
@dcrespo-isaac Scan the logs for messages containing strings similar to |
Thank you for the triage. I've found the logcat I had analyzed for the SIGSEGV errors. In the logcat-Assertion.txt, you'll find two samples of SIGABRT preceded by :
Additional notes:
I will continue to update the issue if I encounter any other assertion errors. |
@dcrespo-isaac I don't have any insight for the SIGSEGV, but for SIGABRT - does your app have a |
Hi @lambdageek, First and foremost, thank you for your response. It is greatly appreciated. We have identified the origin of the SIGABRT:
Regarding this open issue, we continue to experience problems with SIGSEGV, which is the primary subject of this issue and, regrettably, the cause of the majority of our crashes. We are currently deploying our reverted version to Xamarin, as it has proven to be significantly more stable. We will remain vigilant for any responses, fixes, or requests for additional information regarding this issue. |
Hi Team, @mangod9 , We noticed a few weeks ago that the issue was moved to the “Future” milestones. While “No due date” is quite clear, it leaves us with a significant span of uncertainty, especially since Xamarin is no longer supported. We would like to manage our expectations regarding this issue. Will it be fixed or at least investigated by your team of experts? Currently, we have had to completely rollback all our projects to the old .NET Framework/Xamarin to reduce crashes due to SIGSEGV. Please let us know if there are any troubleshooting techniques or tools we can use to provide you with more information, or ideally, find a fix ourselves. We look forward to hearing more on this thread, |
Hello @dcrespo-isaac, you mention that the issue is reduced in 8, but still occurs occasionally? |
Hi @mangod9, It appears there may have been a misunderstanding regarding the issue. Our workaround was to revert to Xamarin because it:
|
Hi @vitek-karas, since this is on Android can you please provide guidance on diagnostic steps. Is there a way to collect some dumps when the failure occurs? |
This issue looks like duplicate of #100311. Did disabling concurrent GC lead to fixing of the crashes ? This issue seems somewhat impossible to investigate without a repro. However, I think we can give an attempt to investigate whether something is wrong with the GC by trying some debug flags like |
Hi @BrzVlad, I reviewed issue #100311, which pertains to SIGABRT. We no longer face issues with SIGABRT; this thread is actually about SIGSEGV. However, @pijnappel has created an issue that seems similar to ours: dotnet/maui#21479. We plan to beta test the version without concurrent GC in a few days and will keep everyone updated. Thank you for the MONO_GC_DEBUG options; I was unaware of some of them, and they could be helpful. What exactly would you need to run everything on your end? |
When there are GC problems, the application can crash in a variety of ways, they are non-deterministic by nature. I am almost certain that the underlying issue is the same and based on comments on other issues it seems that the culprit is somewhere in the concurrent collector. Note that I'm getting crashes with some of these debug options locally on some of our tests suites that are somewhat heavier on the GC. I will have to do a little bit of investigation to see whether they are false positives or not and see if these constraints from the debug options are respected in our own tests suites, before investigating anything on a mobile app. Assuming we can't reproduce this issue, the second approach I would take is to understand more or less what the application was doing when it crashed and then attempt to do more or less the same thing locally, with some different gc params / debug options, in the hope of catching some inconsistencies in the GC. I think an apk with the application with some instructions on what to do in the app could be useful. @grendello I am assuming one could easily unzip an apk, patch |
@BrzVlad Replacing a shared library in the apk requires that you re-align and re-sign the archive, or it won't install and run on the device/emulator. Signing process records sha256 hashes of the shared libraries. When you need to do that, ping us on Discord in the #android channel, we'll help you out :) |
@dcrespo-isaac In addition to testing with concurrent gc disabled, I think it would be worthwhile to keep concurrent gc enabled but use the flags |
Thank you. We will develop a plan to test the different options and will get back to you with the data. At the very least, we will have two versions:
Additionally, I am compiling information to create a brief documentation on how you can test our .apk on your end. |
Don't use the So some explicit configurations could be
|
Hi, Quick update, We're sorry for the delay, we should be able to test and give you more data by next week. |
Hi, @BrzVlad, Sorry this took so long; we had a lot of work to do to release our new beta version. Quick reminder:
About the .apk:
What we have found:
Do you suggest we investigate the last two options, or should we stick with disabling concurrency for now? |
My current understanding is that concurrent GC was enabled by default with the new .NET versions (5+). Concurrent GC was supported in legacy Xamarin but you needed to explicitly add it in the csproj (or click the option in the project editor). This would mean that using this configuration for .NET8 is not really a change in behavior and you shouldn't encounter any problems with pause times. The |
@BrzVlad How are you building with the concurrent GC disabled? Are you just doing it in the csproj configuration with
Or is there other build params/steps? |
I'm encountering an issue that seems specific to Xiaomi devices. The error details are as follows:
This error does not occur on other Android devices or on iOS, only on Xiaomi devices. This config: |
@IainS1986 That is correct. @Th3L0x That is a completely unrelated issue. See #106410 for workaround until we are able to investigate it. |
Would this comment be worth investigating? #106410 (comment) |
Android framework version
net8.0-android
Affected platform version
.NET 8.0.100, .NET 8.0.303
Description
Our Android application started to crash randomly 3 months ago when we upgraded it to .net6.
We tried to add more logging, analyze the logcat, reproduce... but all in vain.
In an attempt to stabilize the app, we have upgraded to .net8.0-android and the number of crash was significantly reduced.
This problem happens on different kind of tablets, and we excluded the idea that it could come from firmware or hardware
because they did not change and with .net framework (Xamarin) everything was running smoothly (1 crash rarely vs hundreds per day now).
Our dev and testing team (qa) are not able to reproduce the problem. Looking at the logs (SIGSEGV_Samples_logcat-with-symbols.txt), we see that all of the SIGSEGV (SEGV_MAPERR) happens on the SGen worker thread but give different backtraces.
At the end of the log file, you’ll find a SIGABRT from SGen. We’ve kept it there for reference, in case it’s related to the SIGSEGV issues.
We used
dotnet-symbol
to retrieve a debug version of libmonosgen-2-0.so (with symbols). Then we runndk-stack
on ourlogcat to symbolize the backtraces.
Application catches nothing. So we don't have any other traces to give from it. The logcat contains the most detailled
ones we have.
FYI, we did split our crashes into 3 categories, we're not sure that they related to the SIGSEGV...
but to give you all data, the other 2 are :
What we tried :
Reproducing by running our acceptance tests on the tablet
-> Nothing happens unless we run the day long tests and it's very rare. It does not give more information than what we have in production.
Upgrading .net version (from 8.0.100 to 8.0.303)
-> Nothing changed.
Revert to .Net Framework (Xamarin)
-> We did it in order to identify if it's related to our changes or the framework. And it worked !!!
The number of crash were back to normal (before .net6 and .net8).
*** EDIT *** We also tried to disable trimming by settings
<PublishTrimmed>false</PublishTrimmed>
in all of our .csproj.-> Our APK was a little bit heavier (of course) and no changes in number of crashes..
What we are still probing :
-> We’ve seen numerous issues and discussions about the stability of SGen. We are hoping that this change can make a difference, as the backtraces seem to point to a problem that could be within it. If this works, it could allow us to stay with .net8 until a fix is found.
Thank you in advance for your help. We will remain vigilant if you need more details.
Steps to Reproduce
Unfortunetaly we are unable to reproduce the problem in step by step as it appears randomly on production.
Did you find any workaround?
Currently, reverting to the .Net Framework appears to be effective for us.
Relevant log output
The text was updated successfully, but these errors were encountered: