Skip to content

Conversation

@elliot-barn
Copy link
Contributor

@elliot-barn elliot-barn commented Oct 7, 2025

upgrading batch inference tests to py3.10

Successful release test run: https://buildkite.com/ray-project/release/builds/65258

all except for image_embedding_from_jsonl are running on python 3.10

Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request upgrades several batch inference and Dask-on-Ray tests to Python 3.10. The changes are straightforward and correct. To improve the maintainability of these YAML configuration files, I've suggested using YAML anchors and aliases. This approach centralizes the Python version, making future updates across these related tests simpler and less prone to error. It's a small refactoring that will enhance the configuration's clarity and ease of maintenance.

# 300 GB image classification parquet data up to 10 GPUs
# 10 g4dn.12xlarge.
- name: "image_classification_{{scaling}}"
python: "3.10"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

To improve maintainability and reduce repetition, you can define a YAML anchor for the Python version here. This allows you to reuse the value in other test configurations within this file by using an alias, making future updates to the Python version much easier and less error-prone.

  python: &py310 "3.10"

--data-directory 300G-image-data-synthetic-raw-parquet --data-format parquet
- name: image_classification_chaos
python: "3.10"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

Please use the YAML alias *py310 defined earlier. This will make the configuration more maintainable.

  python: *py310

--data-directory 300G-image-data-synthetic-raw-parquet --data-format parquet --chaos-test
- name: image_classification_chaos_no_scale_back
python: "3.10"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

Let's use the *py310 alias here for consistency.

  python: *py310

- name: image_embedding_from_uris_{{case}}
python: "3.10"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

Using the *py310 alias here will help with future maintenance.

  python: *py310



- name: image_embedding_from_jsonl_{{case}}
python: "3.10"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

Same as above, please use the *py310 alias.

  python: *py310

script: python image_embedding_from_jsonl/main.py {{args}}

- name: text_embedding_{{case}}
python: "3.10"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

And here as well, please use the *py310 alias.

  python: *py310

- __suffix__: aws

- name: chaos_dask_on_ray_large_scale_test_no_spilling
python: "3.10"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

To improve maintainability and reduce repetition, you can define a YAML anchor for the Python version here. This allows you to reuse the value in the subsequent test configuration within this file by using an alias, making future updates to the Python version easier.

  python: &py310 "3.10"

cluster_compute: dask_on_ray/dask_on_ray_stress_compute_gce.yaml

- name: chaos_dask_on_ray_large_scale_test_spilling
python: "3.10"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

To maintain consistency and improve maintainability, please use the YAML alias *py310 here, as defined in the chaos_dask_on_ray_large_scale_test_no_spilling test.

  python: *py310

@ray-gardener ray-gardener bot added data Ray Data-related issues release-test release test labels Oct 8, 2025
Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
@elliot-barn elliot-barn requested a review from aslonnie October 15, 2025 18:22
Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
@elliot-barn elliot-barn requested a review from aslonnie October 24, 2025 19:23
@aslonnie aslonnie added the go add ONLY when ready to merge, run all tests label Oct 25, 2025
@aslonnie aslonnie self-requested a review October 25, 2025 04:04
@aslonnie aslonnie merged commit 507cbb2 into master Oct 25, 2025
6 of 7 checks passed
@aslonnie aslonnie deleted the elliot-barn/release-data-batch-inference-tests branch October 25, 2025 04:24
xinyuangui2 pushed a commit to xinyuangui2/ray that referenced this pull request Oct 27, 2025
upgrading batch inference tests to py3.10

Successful release test run:
https://buildkite.com/ray-project/release/builds/65258

all except for image_embedding_from_jsonl are running on python 3.10

---------

Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
Signed-off-by: xgui <xgui@anyscale.com>
landscapepainter pushed a commit to landscapepainter/ray that referenced this pull request Nov 17, 2025
upgrading batch inference tests to py3.10

Successful release test run:
https://buildkite.com/ray-project/release/builds/65258

all except for image_embedding_from_jsonl are running on python 3.10

---------

Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
Aydin-ab pushed a commit to Aydin-ab/ray-aydin that referenced this pull request Nov 19, 2025
upgrading batch inference tests to py3.10

Successful release test run:
https://buildkite.com/ray-project/release/builds/65258

all except for image_embedding_from_jsonl are running on python 3.10

---------

Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
Signed-off-by: Aydin Abiar <aydin@anyscale.com>
Future-Outlier pushed a commit to Future-Outlier/ray that referenced this pull request Dec 7, 2025
upgrading batch inference tests to py3.10

Successful release test run:
https://buildkite.com/ray-project/release/builds/65258

all except for image_embedding_from_jsonl are running on python 3.10

---------

Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
Signed-off-by: Future-Outlier <eric901201@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

data Ray Data-related issues go add ONLY when ready to merge, run all tests release-test release test

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants