Releases: project-codeflare/codeflare-sdk
Releases · project-codeflare/codeflare-sdk
v0.6.1 K8S Support and Local Interaction
What's Changed
- add .venv and build directory to git ignore by @KPostOffice in #179
- pre-commit eof fix by @KPostOffice in #199
- update sdk and demos to ray 2.5 by @MichaelClifford in #163
- Downgrade pydantic to prevent breaking introduced by pydantic 2 by @tedhtchang in #215
- Add test dependencies to pyproject.toml by @jiripetrlik in #225
- change additionall to additional by @vaibssingh in #198
- Add pytest-mock to test dependencies by @jiripetrlik in #234
- K8s Support for SDK by @Maxusmusti in #146
- [205] Fix codeflare-sdk release action by @jiripetrlik in #220
- [Automatic] Coverage Badge Update by @github-actions in #176
- add sdk design doc by @MichaelClifford in #235
- add target users doc by @MichaelClifford in #209
- Adjust main repo language to python by @Maxusmusti in #256
- add: cleanup to unit tests by @carsonmh in #247
- Fixed ingress domain methods by @Bobbins228 in #263
- Force install pydantic module below v2.0.0 by @tedhtchang in #264
- Added conditions before deleting template elements by @Bobbins228 in #244
- Changed min/max workers to single field by @Maxusmusti in #266
- cluster status safeguards by @Maxusmusti in #254
New Contributors
- @vaibssingh made their first contribution in #198
Full Changelog: v0.5.0...v0.6.1
v0.5.0 Further Ray Access and Customization
What's Changed
- Allow interaction with ray client service via Route from outside of OCP cluster by @tedhtchang in #100
- Add support for image pull secrets for Ray Cluster images by @VanillaSpoon in #162
New Contributors
- @Bobbins228 made their first contribution in #167
- @VanillaSpoon made their first contribution in #162
Full Changelog: v0.4.5...v0.5.0
v0.4.5 Bug Fixes and Increased Kubernetes Version Support
What's Changed
- Update testing instruction by @tedhtchang in #140
- Fixed head node configuration to match template by @Maxusmusti in #169
- Now supports Kubernetes python v25.3.0
New Contributors
- @tedhtchang made their first contribution in #140
Full Changelog: v0.4.4...v0.4.5
v0.4.4 Minor Updates
- Loading/submitting and deleting AWs from filesystem
- Notebook update to make additional packages pip installable
v0.4.3 Head Node Usability
Fixes issue where small jobs would fail on OpenShift when Ray scheduled them on head node.
v0.4.2 TorchX Scheduler Updates and Fixes
What's Changed
- Change default namespace logic by @anishasthana in #85
- add rdzv_backend parameter to DDPJobDefinition by @MichaelClifford in #87
- make workspace user accessible by @MichaelClifford in #89
New Contributors
- @anishasthana made their first contribution in #85
Full Changelog: v0.4.1...v0.4.2
Caching Bug Fix
Fixes an issue where old TorchX code was being cached and breaking Job submission until refresh.
v0.4.0 Job Management Update
The SDK now has the ability to:
- Submit jobs to a Ray cluster (via TorchX)
- Submit jobs directly to MCAD (via TorchX)
- Check submitted job status
- Check submitted job logs
- Cancel a submitted job
Also included are other minor bug fixes and improvements (mainly clarity on starting vs. running Ray clusters).
v0.3.1 Allow for user defined ports in Auth
What's Changed
- Remove hard-coded port 6443 in auth by @MichaelClifford in #74
Full Changelog: v0.3.0...v0.3.1
v0.3.0 Info and Auth Updates
Updates:
- New all-encompassing cluster.status() method
- New cluster.details() method
- New cluster.wait_ready() method
- New CodeFlareStatus.STARTING status
- Functions get_current_namespace(), list_all_clusters(), and list_all_queued() now all function as intended
- A ton of bug fixes and error-handling for both regular use and edge-cases (not logged in, login expired, no resources created yet,
- resource failure, cluster setup in-progress, etc.)
- A LOT more testing (up to 95%)
- Improved messaging on both expected and unexpected results
Authentication: abstracted from cluster creation/deletion, now can independently authenticate and logout