[air/benchmarks] Fix typo in tensorflow_benchmark.py script preventing proper error surfacing #32269

krfricke · 2023-02-07T17:57:18Z

Signed-off-by: Kai Fricke kai@anyscale.com

Why are these changes needed?

There is a small typo in the tensorflow_benchmark.py script that does not properly catch when a vanilla TF run failed three times. Because of this, we would previously record a training time of 0.0 for vanilla TF, which skews the calculated average and suggests that vanilla TF outperformed Ray Train. Instead, we should have raised an error message to surface the problem.

Related issue number

Closes #31882

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

…g proper error surfacing Signed-off-by: Kai Fricke <kai@anyscale.com>

…g proper error surfacing (ray-project#32269) There is a small typo in the tensorflow_benchmark.py script that does not properly catch when a vanilla TF run failed three times. Because of this, we would previously record a training time of 0.0 for vanilla TF, which skews the calculated average and suggests that vanilla TF outperformed Ray Train. Instead, we should have raised an error message to surface the problem. Signed-off-by: Kai Fricke <kai@anyscale.com>

…g proper error surfacing (ray-project#32269) There is a small typo in the tensorflow_benchmark.py script that does not properly catch when a vanilla TF run failed three times. Because of this, we would previously record a training time of 0.0 for vanilla TF, which skews the calculated average and suggests that vanilla TF outperformed Ray Train. Instead, we should have raised an error message to surface the problem. Signed-off-by: Kai Fricke <kai@anyscale.com> Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>

…g proper error surfacing (ray-project#32269) There is a small typo in the tensorflow_benchmark.py script that does not properly catch when a vanilla TF run failed three times. Because of this, we would previously record a training time of 0.0 for vanilla TF, which skews the calculated average and suggests that vanilla TF outperformed Ray Train. Instead, we should have raised an error message to surface the problem. Signed-off-by: Kai Fricke <kai@anyscale.com>

[air/benchmarks] Fix typo in tensorflow_benchmark.py script preventin…

33d6311

…g proper error surfacing Signed-off-by: Kai Fricke <kai@anyscale.com>

krfricke requested a review from amogkam February 7, 2023 17:57

krfricke assigned amogkam Feb 7, 2023

matthewdeng approved these changes Feb 7, 2023

View reviewed changes

krfricke merged commit c83111a into ray-project:master Feb 7, 2023

krfricke deleted the air/benchmark-tf-typo branch February 7, 2023 18:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[air/benchmarks] Fix typo in tensorflow_benchmark.py script preventing proper error surfacing #32269

[air/benchmarks] Fix typo in tensorflow_benchmark.py script preventing proper error surfacing #32269

krfricke commented Feb 7, 2023

[air/benchmarks] Fix typo in tensorflow_benchmark.py script preventing proper error surfacing #32269

[air/benchmarks] Fix typo in tensorflow_benchmark.py script preventing proper error surfacing #32269

Conversation

krfricke commented Feb 7, 2023

Why are these changes needed?

Related issue number

Checks