-
Notifications
You must be signed in to change notification settings - Fork 0
FW16 Freeze then Reboot (FTR) S5_RESET_STATUS = 0x08000800 <- Sync Flood. #41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
There has been an interest in understanding the port80 codes. I attach some files, that contain the PORT80 codes output during a normal reboot and a normal cold-boot. They have been sorted and de-duped to shorten the list of codes. It would be helpful if FW support could identify what each code means. |
People are seeing 3 different scenarios:
In both cases, no logs or crash dumps to help diagnose the problem.
All three are difficult to reproduce. This issue is only for "Freeze then Reboot" issues. Not "Instant power off". |
As an aid to people trying to understand the EC console output. Please see attached file for the definition of each HC (Host command to EC) |
When comparing multiple port80 traces, from say, repeating a "reboot", and then comparing the sequence of port80 values. It appears to be that the capture of port80 values by the EC is not perfect. There a bit errors and missed values. I will see if there is some bug in the EC code or something else causing the problem. I have now confirmed there is a problem. Further investigation: Side affects: |
During a normal boot (I.e. not a FTR): 00000145 | 5f535452 | _STR |
The following file contains the Port80 codes during a reboot of the Linux operating system normally. I.e. User clicked on "reboot". The purpose of this is to
|
The changes I have made to the EC and ectool that work with a FW16 AMD are here: To get longer port80 history: I have found that 4096 is about the largest value you can give it, otherwise the EC runs out of RAM and behaves badly. |
My updates to the ECTool.efi reflash program, to make it slightly safer to use. I only ever write to the RW section, never to the RO section, so that if something goes wrong, it falls back to the FW provided EC firmware. |
Hi, I'm using Framework 16 AMD Ryzen 7 7840HS with Radeon 780M ( no dGPU )
I originally thought this issue was with amdgpu/drm as I don't use stable kernels anymore ( not that I intend to continue but I do it to grab amdgpu/drm issues ASAP ), since kernel 6.12-rc1 I've only used mainline ( Linus Torvalds branch ) This FTR is something I've been experiencing for sometime and since BIOS 3.05 it's increased ( I believe it happened 3 times when using BIOS 3.04 but I just assumed it was amdgpu/drm at the time ) what @jcdutton says here, point 2 is the one I'm experiencing
The FTR issue for me triggers more frequently ( from multiple times a day down to 1-3 days ) when the Framework 16 display panel refresh rate is at 165HZ ( with Panel Self Refresh/Panel Replay enabled ), also using an external display ( 3840x2160@60HZ ) and in DC power mode ( Framework battery ) I've also noticed FTR happens ( maybe coincidence ) when watching a video ( youtube on firefox for example ) on primary or secondary display but then again the issues happens when I'm just programming ( RustRover mostly ) or browsing the web but it's random like others have said and when in AC power mode ( Framework 180W power adapter ), it could be a week or so before it FTR. This is all while the power profile ( set via KDE power devil ) is set to powersave whether I'm in AC or DC power mode. With the display panel refresh rate is set to 60HZ, it still FTR's but not as frequently ( that's how it feels ). For the past week-ish though with power profile set to balanced ( sometimes performance when compiling software ) and using AC power mode ( Framework 180W power adapter ), display panel refresh rate set to 165HZ, it hasn't FTR yet. From my perspective, it's looking like something to do with when running in powersave and it's worst when in DC power mode Since day one I've had the charge limit on the battery set to 80% and most of the time I'm using powersave. What I'm using to change the power profiles ( GUI component ) with KDE, I think it's called power devil that's communicating with power profile daemon Because I set the power profile via KDE power devil and not directly in the pseudo/proc files, the terms between KDE and AMD not fully expressed KDE -> AMD powersave -> powersave also ( may not have anything to do with it ) when using power profile powersave, CPB ( Core Performance Boost ) is turned off and the iGPU performance level is set to low, in other power profiles CPB is enabled and iGPU performance level set to auto I've only just learnt of this issue here today but been experiencing this FTR since last year edit1: I forgot to add. Just like others after reboot, I check the kernel logs for warnings/errors and there is none edit2: power profile daemon also adjust AMD's ABM ( Adaptive Backlight Management ) if you don't have the kernel parameter amdgpu.abmlevel set. Mine is set as amdgpu.abmlevel=2 |
@sinatosk just to confirm, you DON'T HAVE the AMD RX7700S in the extension port thiny? This is all occurring on your iGPU? |
correct |
Another observation. |
@sinatosk |
try it in DC power mode ( battery ) too please |
I have some more info that might help track this down. The quote is from someone at AMD: https://marc.info/?l=linux-i2c&m=168089982408414 There should be some public documentation on interpreting So, If anyone wishes to compile their own kernels and help with this problem, its very worth while applying the above patch. Normal values: So, we are looking for values that are different than those three values. |
I'll try that on Saturday, busy tomorrow and Friday |
I have a 13" FW AMD and a longstanding issue with similar symptoms (20s freeze -> reboot, no logs), although mostly after resume from suspend. I now have 6.12.13 compiled with the proposed patch and after a reboot ( and an unrelated hard reset in between :), i find:
Would this (output after of the next freeze/reboot cycle, oc) be of any help, albeit a FW 13 AMD? |
Some BIOS Port codes can be found here: For example, IPL POST codes are prefixed with 0xee00...., and ABL POST codes are prefixed with 0xea00.... |
FWIW, perhaps this can help narrow down the cause or help others being plagued by this issue. I experienced this (FTR) issue since i received the laptop (dec 23) every 3-5 days and around September 2024, i contacted FW support, who suggested to perform a "mainboard reset" by the instructions following below. After this "mainboard reset", which i suppose resets some kind of nvram on the board/ec (obviously, i don't understand all this low level stuff well enough), but since this reset to this day (around 4-5 months), i had around 3-4 FTR cycles so far (which could as well be an whole different issue), but the regular, every few days, FTR issue was/is, apparently/hopefully gone. Considering this, it may take a while for this to happen again here, but if it does, i'll remember and post the value in this issue.
It's a 7840u here. I'd consider this to be a different layout of the same architecture. Quote Instructions from Framework Support: Please perform a full mainboard reset. This will clear up any states that are saved. Kindly follow the instructions below and see how it goes. Plug in the system to AC. |
@sydneymeyer So do you suggest people experiencing this issue to try this as well ? It will only reset the BIOS settings ? |
@YummYume This is what Framework Support is suggesting people experiencing this issue to do and what had the effect for me i described above.
Well, it resets the BIOS to factory defaults, e.g. if you have set up Secure Boot with your own keys, you might have to re-enroll them. Perhaps you also might have to restore the UEFI-Boot Entries, can't remember. But other than that, yes. |
@sydneymeyer Thanks, I'll probably try doing that tomorrow and see if it fixes anything. It's weird because I haven't touched my BIOS much anyway other than changing the max charging power for the battery. |
Just a comment. This issue ticket is aimed at finding the cause of the problem. So hiding it with a main board reset etc. is not the aim of this issue ticket. I.e. we are looking for a root cause. |
Hi, didn't get a chance to do it yesterday as I said I would. I'll apply the patch tomorrow ( after linux 6.14-rc3 is released ) I'll then try to trigger the issue ( if possible ) over the week or two |
Just had a screen freeze didn't go black but it also didn't restart so I held power button to force reboot code is 0x00000800 but @jcdutton already showed that took 3 days for that to happen - linux 6.14-rc3, DC power mode ( battery ), external monitor enabled, framework display at 165Hz, power profile set to powersave, CPB off and iGPU low edit1: still continuing to test this issue |
But, just to be clear. We now only have an error sample size of 1. |
@sydneymeyer |
The issue occurs exclusively after resume from suspend. The Laptop is mostly connected to a USB-C powered dock with HDMI, Ethernet and a Apple Magic Trackpad 2 attached. FTR cycles have occured while connected to a dock (connected before suspend, while sleeping and when resuming), after disconnecting the dock while sleeping and resuming standalone, and completely standalone (i.e. suspend/resume cycles without attached dock). Next boot, the last logged line in the system logs is usually from suspending the laptop (via systemd), nothing else. "ectool panicinfo" gives no "no panic data". OS is NixOS with respective stable kernel. Programs open are usually sway (vulkan), Firefox (w/ vaapi enabled), Thunderbird, mpv (vulkan), foot, vim, cmus with pipewire, networkd, iwd.. More or less. Unfortunately, i have no way of reproducing the issue or have recognized a pattern other than what's described above.
|
I just had a FTR. 2025-02-22 S5_RESET_STATUS = 0x08000800 <- Sync Flood. Here is the port80 output: Looking at the port80 output. It looks to me that the FTR happened here: and then after the reboot, the boot up entered / started the Linux kernel here: Note: the aaaa0001 is added by me, so not a port80 code from the BIOS. I have added a few aaaaXXXX port80 codes written during the kernel panic and emergency reboot parts of the Linux kernel, and none of those aaaaXXXX port80 codes are present in the output, so this did not reboot due to a panic or anything like that. So, we now have reports from 2 people with the same S5_RESET_STATUS = 0x08000800. |
Mar 02 15:45:20 fw kernel: S5_RESET_STATUS = 0x08000800, like usual, i.e. while resuming from suspend, no logs at all. |
@jcdutton Do you need anything else from me or the machine in this state, because i'm considering resetting and selling the laptop. I'm sick of all the unresolved issues this computer has. |
I was linked here from the forums (https://community.frame.work/t/sudden-reboot-on-wake-with-no-logs/65434 ) and I'm experiencing a similar issue with my Framwork 13 / R5 7640U. I applied the S5_RESET_STATUS patch (with light modification), and got a seemingly-new status code |
I have found a message that contains something helpful. In that message, there is a link to this URL: In that zip file there are some PDFs. 13:10: Reserved. |
So, S5_RESET_STATUS = 0x00800800 means: So, this implies the CPU caused the shutdown. |
Also from the PDF document linked above: Shutdown (message from CPU): SYNC_FLOOD (message) |
Does this (warm reset) mean the system, i.e. the EC/Kernel/CPU (obviously, i don't get it), can/could recover from this happening? |
Sometimes, this S5_RESET_STATUS patch might work better: |
This S5_RESET_STATUS patch should apply to kernel 6.13.5 kernel: |
It probably refers to https://en.wikipedia.org/wiki/Reboot#Cold_versus_warm_reboot , the difference seeming to be whether the system performs a complete POST sequence or just a small part. Doesn't sound recoverable unfortunately . |
This seems vaguely familiar ... Since I am running primarily on Windows, extensive Linux testing is unlikely, but I have a Linux install (coincidentally also seem to have sleep issues). However, if there are things I can do to help either of those issues. I would gladly help. Is there a way for me to like, tap into the bus? There is a EC header, and I have logic analyzers. I can solder wires to onboard headers and monitor it externally. Might be a case where a tiny second laptop like my GPD would come in handy. |
@CDRXavier
We are only focusing on (1) in this ISSUE ticket. |
@jcdutton Would you like me to make a new ticket for (2) because that was the cause of me creating the conversation in the support forums which has had a lot of convo in it. I felt like (2) is more common pain point for everyone. EDIT: To Be Honest, i thought this ticket was all about (2) really |
@PureKrome I would join you in that ticket. I am almost certain that my issue is (2), but since installing 6.13.5 with the recommended patch yesterday I haven't been able to repro. I've seen it a lot between 6.12 and 6.13.4. Anyone seen it on 6.13.5 yet? In case I haven't mentioned it yet I have a FW 13 Ryzen 7 7840U with these cards:
|
Hi All,
We are only focusing on (1) in this ISSUE ticket. Please do not report here if you have (2). Report it in a new issue. |
@jcdutton That issue has been reported here in the official FW Community Forums. Do I need to also create a ticket here in GH? (also here's another link in the FW CF for another potentially related one) |
@PureKrome @jcdutton I've done it: #50 Can we work on centralizing folks around this issue? Please spread the word. I would really like to hear back from a Framework engineer because I cannot use my laptop because of this. |
FWIW, i'm now running 6.14-rc6 with this patch. |
Framework 16 AMD Ryzen 7 7840HS using Radeon 780M I use KDE Plamsa powerdevil to change the power profiles Framework port usage:
I just had a FTR
No external display was in use, none plugged in and the main display ( Framework ) refresh rate set too 165Hz with ABM ( Active Backlight Management ) set to 2 via kernel command line parameter
The last FTR I had was with kernel 6.14-rc1 or rc2 |
Should we continue to report every instance of this (S5_RESET_STATUS = 0x08000800) happening, or is there anything else we can do regarding this issue? To be frank, i don't see Framework taking any interest in this at all (except from what appears to be "managing the community"), and i have no desire to compile the kernel from scratch everytime i'm doing updates forever until Framework feels like it.. or perhaps not. Who would know. Besides, there appears to be nothing new to report, always the same, already reported many times, circumstances. |
@sydneymeyer I think it is fair to say that the problem is not reproducible on demand. Its occurrence is apparently random. I have been asked by FW support to do: You might wish to try it also, to see if it helps or not. |
Device Information
System Model or SKU
[ ] Framework Laptop 16 (AMD Ryzen™ 7040 Series)
No dGPU.
BIOS VERSION
3.0.5
Windows:
N/A
Linux:
Open a terminal and run the following command
sudo dmidecode --string bios-version
03.05
DIY Edition information
Memory: Manufacture and SKU
Kingston Fury Impact: Part Number: KF556S40-32
2x making 64GB total.
Storage: Manufacture and SKU
Model Number: WD_BLACK SN850X 1000GB
Firmware Version: 620361WD
Port/Peripheral information
Standalone Operation
Are you running your mainboard as a standalone device. Is standalone mode enabled in the BIOS?
Describe the bug
S5_RESET_STATUS = 0x08000800 <- Sync Flood.
Occasionally, about once a month I get a random crash/freeze then about 20 seconds later a reboot.
There is some details in this community thread:
https://community.frame.work/t/frwk16-random-crash-then-reboots/62411/31
This issue is only for "Freeze then Reboot" issues. Not "Freeze then power off".
Steps To Reproduce
Steps to reproduce the behavior:
Note: I generally have the power plugged in most of the time. For all the FTR I have seen, the power was plugged in at the time. The PSU used is the FW provided one that comes with the FW16.
Note: The FW16 was not under high load. the cpu fans were not audibly running. I.e. I could not hear them above the netflix / youtube video playing.
Expected behavior
It should not randomly freeze then reboot. (FTR)
Screenshots
N/A
Operating System (please complete the following information):
uname -a
6.12.7 <- Mainline compiled kernel.Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: