-
Notifications
You must be signed in to change notification settings - Fork 5.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding VMware platforms support such as vSphere to Ray Autoscaler #37815
Adding VMware platforms support such as vSphere to Ray Autoscaler #37815
Conversation
Signed-off-by: Shubham Urkade <surkade@vmware.com>
dc571fc
to
322a15e
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @Shubhamurkade, thanks for the work you've put into the PR. The PR is quite large and that makes it hard to review thoroughly. It would be really helpful if you could split the PR into a series of smaller PRs. Do you see a good way to split it up?
.. tab-item:: vSphere | ||
|
||
.. code-block:: shell | ||
|
||
$ pip install vsphere-automation-sdk-python | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
.. tab-item:: vSphere | |
.. code-block:: shell | |
$ pip install vsphere-automation-sdk-python | |
.. tab-item:: vSphere (Experimental) | |
.. code-block:: shell | |
$ pip install -U "ray[default]" vsphere-automation-sdk-python | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Would opt for labeling this as experimental for now until the feature sees more users and becomes more stable.
$ export VSPHERE_SERVER = 192.168.0.1 # Enter your vSphere IP | ||
$ export VSPHERE_USER = user # Enter your user name | ||
$ export VSPHERE_PASSWORD = password # Enter your password | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Might be good to link to relevant vSphere documentation here, similar to how we link to GCP docs for credentials above
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @architkulkarni There's no standard way to provide credentials to vSphere. Our code just expects the user to provide these credentials as env variables.
Hi @architkulkarni The majority of the changes are in
Please let me know if you would need more support. |
I see. In that case, I like your idea 3--would you mind adding some documentation, perhaps in an No need to split the docs and sample YAML into another PR. I'll review this PR piece by piece but may not have time to review all of it at once. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've reviewed up to create_node
in node_provider.py
, only minor comments so far.
vsphere_credentials["user"], | ||
vsphere_credentials["password"], | ||
VmwSdkClient.SessionType.UNVERIFIED, | ||
VmwSdkClient.ClientType.PYVMOMI_SDK, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What's the difference between automation and pyvmomi?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
automation sdk uses REST interface & Pyvmomi is supported by VMware SOAP APIs.
We are currently using APIs from both. We plan to move to either of them once we identify the necessary support.
Signed-off-by: Chen Hui <huchen@vmware.com>
Signed-off-by: Chen Hui <huchen@vmware.com>
update the test-script to support test ray-on-vsphere
instant clone only, and address review comments
Thanks for adding the doc, will continue the review shortly! |
Added, thanks for pointing that out. |
It looks like this got triggered from the force-push. It doesn't look like the file was actually changed, so from an external code perspective, we can go ahead. |
Any ideas about this build failure? Not sure what could be causing it off the top of my head. https://buildkite.com/ray-project/oss-ci-build-pr/builds/31726#0189f389-1c82-452a-a638-1c0b7a963aec/5460-5485 Happens to several (all?) python tests: https://buildkite.com/ray-project/oss-ci-build-pr/builds/31726#0189f389-1c6b-4ef2-98e4-a1a14c9876e8/1741-1781 |
python/ray/tests/BUILD
Outdated
py_test( | ||
name = "test_vsphere_node_provider", | ||
size = "small", | ||
srcs = ["gcp/test_vsphere_node_provider.py"], |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
srcs = ["gcp/test_vsphere_node_provider.py"], | |
srcs = ["vsphere/test_vsphere_node_provider.py"], |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this is causing the build failure
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh gosh, will fix this right now.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fxied in the latest commit, thanks for helping us do the troubleshooting!
Signed-off-by: Chen Jing <jingch@vmware.com>
Yes, we used to have one PR tried to fixed the DCO issue, but we failed, so did a force push to revert, to avoid messing up the commits. |
Signed-off-by: Chen Jing <jingch@vmware.com>
Signed-off-by: Chen Jing <jingch@vmware.com>
Signed-off-by: Chen Jing <jingch@vmware.com>
Signed-off-by: Chen Jing <jingch@vmware.com>
Signed-off-by: Chen Jing <jingch@vmware.com>
Signed-off-by: Chen Jing <jingch@vmware.com>
Failed tests:
|
…y-project#37815) --------- Signed-off-by: Shubham Urkade <surkade@vmware.com> Signed-off-by: Chen Hui <huchen@vmware.com> Signed-off-by: Chen Jing <jingch@vmware.com> Co-authored-by: Chen Hui <huchen@vmware.com> Co-authored-by: Chen Jing <jingch@vmware.com> Signed-off-by: e428265 <arvind.chandramouli@lmco.com>
Similar to other providers, we add example-minimal.yaml and example-full.yaml to vSphere autoscaler. And we add and refine vSphere related references in the Getting Started guide as well as the cluster configuration reference page, based on the newly added examples. Why are these changes needed? In PR #37815 we've added vSphere platform support to Ray Autoscaler. However, the related documents are not sufficient. This follow-up change adds related examples similar to other platforms. The related documents including the getting-started guide as well as the cluster configuration reference also need to be updated to include descriptions specific for vSphere. We will do another follow-up PR to add a "Launching Ray Clusters on vSphere" user guide at https://docs.ray.io/en/latest/cluster/vms/user-guides/launching-clusters/index.html Signed-off-by: Fangchi Wang <wfangchi@vmware.com>
…project#39379) Similar to other providers, we add example-minimal.yaml and example-full.yaml to vSphere autoscaler. And we add and refine vSphere related references in the Getting Started guide as well as the cluster configuration reference page, based on the newly added examples. Why are these changes needed? In PR ray-project#37815 we've added vSphere platform support to Ray Autoscaler. However, the related documents are not sufficient. This follow-up change adds related examples similar to other platforms. The related documents including the getting-started guide as well as the cluster configuration reference also need to be updated to include descriptions specific for vSphere. We will do another follow-up PR to add a "Launching Ray Clusters on vSphere" user guide at https://docs.ray.io/en/latest/cluster/vms/user-guides/launching-clusters/index.html Signed-off-by: Fangchi Wang <wfangchi@vmware.com>
…) (#39399) Similar to other providers, we add example-minimal.yaml and example-full.yaml to vSphere autoscaler. And we add and refine vSphere related references in the Getting Started guide as well as the cluster configuration reference page, based on the newly added examples. Why are these changes needed? In PR #37815 we've added vSphere platform support to Ray Autoscaler. However, the related documents are not sufficient. This follow-up change adds related examples similar to other platforms. The related documents including the getting-started guide as well as the cluster configuration reference also need to be updated to include descriptions specific for vSphere. We will do another follow-up PR to add a "Launching Ray Clusters on vSphere" user guide at https://docs.ray.io/en/latest/cluster/vms/user-guides/launching-clusters/index.html Signed-off-by: Fangchi Wang <wfangchi@vmware.com> Co-authored-by: Fangchi Wang <wfangchi@vmware.com>
…project#39379) Similar to other providers, we add example-minimal.yaml and example-full.yaml to vSphere autoscaler. And we add and refine vSphere related references in the Getting Started guide as well as the cluster configuration reference page, based on the newly added examples. Why are these changes needed? In PR ray-project#37815 we've added vSphere platform support to Ray Autoscaler. However, the related documents are not sufficient. This follow-up change adds related examples similar to other platforms. The related documents including the getting-started guide as well as the cluster configuration reference also need to be updated to include descriptions specific for vSphere. We will do another follow-up PR to add a "Launching Ray Clusters on vSphere" user guide at https://docs.ray.io/en/latest/cluster/vms/user-guides/launching-clusters/index.html Signed-off-by: Fangchi Wang <wfangchi@vmware.com> Signed-off-by: Jim Thompson <jimthompson5802@gmail.com>
…y-project#37815) --------- Signed-off-by: Shubham Urkade <surkade@vmware.com> Signed-off-by: Chen Hui <huchen@vmware.com> Signed-off-by: Chen Jing <jingch@vmware.com> Co-authored-by: Chen Hui <huchen@vmware.com> Co-authored-by: Chen Jing <jingch@vmware.com> Signed-off-by: Victor <vctr.y.m@example.com>
…project#39379) Similar to other providers, we add example-minimal.yaml and example-full.yaml to vSphere autoscaler. And we add and refine vSphere related references in the Getting Started guide as well as the cluster configuration reference page, based on the newly added examples. Why are these changes needed? In PR ray-project#37815 we've added vSphere platform support to Ray Autoscaler. However, the related documents are not sufficient. This follow-up change adds related examples similar to other platforms. The related documents including the getting-started guide as well as the cluster configuration reference also need to be updated to include descriptions specific for vSphere. We will do another follow-up PR to add a "Launching Ray Clusters on vSphere" user guide at https://docs.ray.io/en/latest/cluster/vms/user-guides/launching-clusters/index.html Signed-off-by: Fangchi Wang <wfangchi@vmware.com> Signed-off-by: Victor <vctr.y.m@example.com>
Why are these changes needed?
Add support for ray to run on vSphere
Related issue number
Checks
git commit -s
) in this PR.scripts/format.sh
to lint the changes in this PR.method in Tune, I've added it in
doc/source/tune/api/
under thecorresponding
.rst
file.