-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[tests] Ray nightly image tests with pandas+numpy fails with TensorDType error #2452
Comments
Seems to be okay on the CI now - will investigate this if it starts showing up again |
Seems like the issue is back, will investigate: https://github.com/ludwig-ai/ludwig/runs/8273624112?check_suite_focus=true |
@arnavgarg1 the CI was okay because the test is being skipped on nightly. We still need to fix it. It's curious that now it's showing up for ray 2.0 tests as well all of a sudden. |
@arnavgarg1 re-opening this issue to track. |
@tgaddair Thanks for re-opening the issue. It seems like this happens because of the non-determinism of "nan_percentage", particularly in cases where the last row in a partition is NaN since the missing value strategy is I'll create a fix that involves ensuring that the last row of our random sampling isn't a NaN so that this situation is avoided. Might be worth calling out in our documentation somewhere as well since this can cause other errors downstream beyond our tests. |
Actually, the better way might be to do a bfill followed by ffill, or ffill followed by bfill to ensure there's never any NaNs |
https://github.com/ludwig-ai/ludwig/runs/8177672264?check_suite_focus=true#step:10:7153
For some reason this does not repro locally, so could be an issue with different versions of pyarrow or another dependency.
The text was updated successfully, but these errors were encountered: