-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dist-upgrade process stuck - migration not progressing #384
Comments
Hello. |
Hi, sorry, I thought I attached the archive already to my initial post. Let me attach it to this comment here. Thank you for the swift reply, its really appreciated! PS: The archive is too big to be attached to this comment directly, I uploaded it to WeTransfer - the link expires in 7 days: https://we.tl/t-tgcHyvxp5d |
I noticed unusual time behavior on the server on finishing stage:
As you can see, during the final stage of the conversion, the time switched from 20 to 21, 21 to 22, and even 22 back to 21. It seems you might have a remote NTP device providing the current time, or something similar. Is it possible the connection to the device is unstable? Or perhaps you have two different NTP servers giving different time values? The only issue this causes is with the progress bar of the centos2alma conversion. We currently rely on the server's time, which leads to these issues. It might be a good idea to calculate time manually or perhaps disable any NTP synchronization during the conversion. I will have to consider this further. |
Hi, following your latest response I reran the migration after restoring an earlier snapshot of my server (VPS). This time I had to again manually reboot the server. After doing the preparation checks from centos2alma the migration started as usual and all steps/actions were listed and progressing nicely. After the conversion step the server rebooted automatically but did not boot up properly and I was again not able to SSH into my server. So after giving it about 30 minutes of time I decided to reboot the server manually, after which I was able to SSH back into my server. A quick
So I did as you said and ran a
Also Then I ran some further commands to check if the migration went successfully and all of them returned that I am still on CentOS 7.9. Also the Plesk Panel still shows CentOS 7.9 as my current OS.
So I am a bit confused now. Apparently the tool just ran as intended but my OS is not being recognized as AlmaLinux. Any ideas what might cause this? I have attached a new feedback archive from the now apparently successful run. Thank you so much for your support! PS: I just saw that the log file seems to append every call / run. The one that finished this time started at 11:46 and finishes at 12:33 in the logs. |
This seems awkward. I can only assume that the temporary container did not have enough time to install the el8 packages instead of the el7 packages before you restarted the server |
Hey, no, I don't have a console connected to it. It's a virtual private server hosted by a hosting company. I can only SSH or VNC into the server. I will start over again and give it more time. It is just weird because after it reboots for the first time automatically, it looks like the server isn't getting up at all. I cannot ping its IP, SSH into it, or anything. I left it in that state for about 30 minutes the last time before I decided to give it a manual reboot, but I will give it more time now and see what happens... |
Yes, unfortunately, this is the expected behavior of the temporary container that reinstalls packages. I hope to find a way to start the network interface for it in the future or encourage the AlmaLinux/leapp framework developers to address it. |
But is it also normal for it to never come back by itself? I had that situation already yesterday where I left it running overnight and still had to reboot in the morning. Or is the normal behavior that it reboots automatically into an SSH-able state after the temporary container OS did its work? Because then that's a state my server never seems to reach by itself due to some reason. |
OK, I re-did it now again from a fresh snapshot and it went exactly like in this comment here: [https://github.com//issues/384#issuecomment-2449675216](earlier comment) The only difference is, that I left it in the dist-upgrade stage for 80 minutes before forcing the reboot, so it had plenty of time to finish all tasks in the temporary OS container. The results are still the same, the OS still comes back as CentOS 7.9, even after centos2alma reported it finished. The feedback report archive from this run is again attached to this ticket. I am really clueless. Any ideas what I could do here? Thank you for your ongoing support! |
Normally, the temporary container should reboot into SSH-able state. I believe, To move forward, we need to determine the exact reason why the temporary container is stuck. Could you please execute
We might be able to identify something unusual in these reports. At the moment, I have no other suggestions on how to proceed. |
Hi,
I have the following issue:
I am migrating from CentOS 7.9 to Alma Linux using the latest (as of Oct. 30, 2024) centos2alma. After cleaning up a few issues during the prepare stage I was able to get the tool running and the preparation stage looked fine, it also moved into the conversion stage fairly quickly and did the first reboot.
But then it got stuck. I could not SSH into the server for over an hour, so I decided to give the server a manual reboot. After the reboot I was able to SSH into the server again and saw the following welcome message:
When I ran
./centos2alma --status
I got the following message:So I checked with
ps -aux | grep centos2alma
and found a process, which I killed withkill -9 <PID>
as the message above clearly said it would be OK to interrupt the process and the log file did not update for a while.After that I did a reboot of the server.
When I SSH'ed back into the server I saw that no process of centos2alma was running anymore, but still, when I ran
./centos2alma --status
I got the same message with thedist-upgrade is taking too long. It may be stuck...
, even thoughps -aux | grep centos2alma
returns no running processes anymore.Restarting the tool via
./centos2alma &
does not work as well as it only gives the following message:even though no process is running anymore.
Any idea what could have happened and how I can fix the stuck-while-not-running process?
I attached the feedback archive as well.
Thank you for your support - I am running out of ideas what to do now.
The text was updated successfully, but these errors were encountered: