Skip to content

Gb/aws #339

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 22 commits into from
Dec 21, 2021
Merged

Gb/aws #339

merged 22 commits into from
Dec 21, 2021

Conversation

grantbuster
Copy link
Member

No description provided.

@grantbuster grantbuster mentioned this pull request Dec 21, 2021
Copy link
Collaborator

@MRossol MRossol left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is SO COOL!

@grantbuster
Copy link
Member Author

Haha hey there! Yes we did it! Did you check out the aws_pcluster readme? We tried a few things and i figured out how to get HSDS local servers set up with minimal effort. It runs!

@grantbuster
Copy link
Member Author

And i should note that we started scoping this out and then realized "hey we can already do all of this thanks to michael and john readey!"

@MRossol
Copy link
Collaborator

MRossol commented Dec 21, 2021

Well done, glad it wasn't to "scary"! So by local HSDS cluster are you just running a single docker container on each "node" or are you running a HSDS cluster in parallel to the "nodes".

@grantbuster
Copy link
Member Author

grantbuster commented Dec 21, 2021

A little scary but I ended up getting hooked and needed to figure it out haha.

Yeah so we got lambda working but the problem with that is reV has really small regular requests and each lambda call needs to spin up a service which has a lot of overhead.

The current solution is to submit reV jobs on the AWS parallel cluster via slurm (just like the NREL HPC). When a reV job gets an EC2 node it runs a shell script first that checks to see if there is an HSDS server running on that node. If the node just spun up, the shell script starts the local HSDS server on that node with N parallel docker instances. If the node already has an HSDS server running, it just moves on. It works pretty well!!

Kubernetes for the HSDS service works too but its more setup and requires more EC2 instances so i think we'll just put this aside for now.

@MRossol
Copy link
Collaborator

MRossol commented Dec 21, 2021

Thats awesome! Well done and yay for HSDS!

@grantbuster
Copy link
Member Author

yay indeed! you and John kicked ass.

@grantbuster grantbuster merged commit 0a0ee79 into main Dec 21, 2021
@grantbuster grantbuster deleted the gb/aws branch December 21, 2021 17:32
github-actions bot pushed a commit that referenced this pull request Dec 21, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants