Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Check PSU on MOC-R4PAC21U23-S3 #1442

Open
naved001 opened this issue Dec 9, 2024 · 4 comments
Open

Check PSU on MOC-R4PAC21U23-S3 #1442

naved001 opened this issue Dec 9, 2024 · 4 comments
Assignees
Labels
mghpcc MGHPCC related tasks techsquare

Comments

@naved001
Copy link

naved001 commented Dec 9, 2024

Sevice tag: GTJV4Z2
Host: MOC-R4PAC21U23-S3-OBM
Location: Row4 PodA Cage 21 U23 Slot 3

One of the power supplies has failed on this host. The node does not power on even though the other PSU reports being up.

Image

@naved001 naved001 added mghpcc MGHPCC related tasks techsquare labels Dec 9, 2024
@msdisme
Copy link

msdisme commented Dec 10, 2024

_email to moca-contact@techquare:

(cced Scott because I was not sure moca-contact is still active 😊)

Hi,

Could you please take a look at this issue: #1442? If it is as simple as a blown power supply you can grab one from the spare parts rack. Though it has 1 PSU still working it will not power on when we try remotely.

If you do not think you will be able to get to this before December 13th please let us know in the ticket so we may arrange a different system for the researchers to work on.

Ideally we would like the machine back in service. If not possible , we would like to keep the disk state as one of the researchers has some uncommitted changes on it ☹.

Thanks!

Michael Daitzman_

@Jperez3313
Copy link

Hi all,

I initially replaced it with one from the spare parts rack, but the replacement didn’t work. I then swapped the power cable to rule out a faulty cable, but that wasn’t the issue either. To double-check, I swapped the two power supplies within the server itself. Seems like the issue isn’t with the PSU but rather with the PSU socket in the server.

Let me know if I can be of any further assistance.

Cheers,
Jackson, TechSquare for MOC

@naved001
Copy link
Author

Hi @Jperez3313

Could you please move the 2 drives from the dead server (MOC-R4PAC21U23-S3-OBM, GTJV4Z2) to another server (MOC-R4PAC24U17-S1)? I think the destination server should have enough slots

Destination Server details are:
Location: Row 4 Pod A Cage 27 Unit 17 Slot 1
Host: MOC-R4PAC24U17-S1
Service Tag: DBGHBZ2
IPMI IP: 10.2.11.91

Thank you!

@Jperez3313
Copy link

@naved001 All set!
Cheers,
Jackson

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
mghpcc MGHPCC related tasks techsquare
Projects
None yet
Development

No branches or pull requests

4 participants