Removed limit from the test #5033

razajafri · 2022-03-23T23:22:52Z

The order isn't guaranteed when using limit which causes the test to fail intermittently

Removed limit
added ignore_order

fixes #5021

Signed-off-by: Raza Jafri rjafri@nvidia.com

Signed-off-by: Raza Jafri <rjafri@nvidia.com>

razajafri · 2022-03-24T00:19:24Z

build

gerashegalov · 2022-03-24T00:40:00Z

integration_tests/src/main/python/cache_test.py

@@ -122,7 +122,7 @@ def test_cache_reverse_order(enable_vectorized_conf):
    col1 = StructGen([['child0', byte_gen]])
    def partial_return():
        def partial_return_cache(spark):
-            return two_col_df(spark, col0, col1).select(f.col("a"), f.col("b")).cache().limit(50).select(f.col("b"), f.col("a"))
+            return two_col_df(spark, col0, col1).select(f.col("a"), f.col("b")).cache().select(f.col("b"), f.col("a"))


Is the order really guaranteed without limit? Adding @ignore_order test marker looks like a more intuitive solution to me

That will only sort the collected list AFAIK which by the time it comes back to the driver is too late, no?

Or are you asking to add ignore_order in addition to the change? I can do that

You are right about the local sort, but you can also say @ignore_order(local=False)

and you are right about removing limit, since this ends up just sorting a non-deterministic subset instead of taking the head of a sorted list.

So, I would expect that to work but setting local=False still applies sort after the limit resulting in the test to fail.

As discussed offline, I will remove the limit and add ignore_order

gerashegalov · 2022-03-24T00:42:13Z

integration_tests/run_pyspark_from_build.sh

@@ -178,7 +178,7 @@ else
    export PYSP_TEST_spark_driver_extraClassPath="${ALL_JARS// /:}"
    export PYSP_TEST_spark_executor_extraClassPath="${ALL_JARS// /:}"
    export PYSP_TEST_spark_driver_extraJavaOptions="-ea -Duser.timezone=UTC $COVERAGE_SUBMIT_FLAGS"
-    export PYSP_TEST_spark_executor_extraJavaOptions='-ea -Duser.timezone=UTC'
+    export PYSP_TEST_spark_executor_extraJavaOptions="-ea -Duser.timezone=UTC $COVERAGE_SUBMIT_FLAGS_EXEC"


We have an issue for this: #4948 . It should be a separate PR.

Signed-off-by: Raza Jafri <rjafri@nvidia.com>

razajafri · 2022-03-24T18:40:53Z

build

removed limit

618af14

Signed-off-by: Raza Jafri <rjafri@nvidia.com>

razajafri requested a review from gerashegalov March 23, 2022 23:23

gerashegalov reviewed Mar 24, 2022

View reviewed changes

sameerz added the test Only impacts tests label Mar 24, 2022

sameerz added this to the Mar 21 - Apr 1 milestone Mar 24, 2022

razajafri added 2 commits March 24, 2022 11:07

revert change to add executor flags

427b492

Signed-off-by: Raza Jafri <rjafri@nvidia.com>

adding ignore_order to guarantee the order

0b914fc

Signed-off-by: Raza Jafri <rjafri@nvidia.com>

gerashegalov approved these changes Mar 24, 2022

View reviewed changes

razajafri merged commit 2c4c91f into NVIDIA:branch-22.04 Mar 24, 2022

razajafri deleted the SR-5021-cache-reverse branch May 11, 2022 19:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Removed limit from the test #5033

Removed limit from the test #5033

razajafri commented Mar 23, 2022 •

edited

Loading

razajafri commented Mar 24, 2022

gerashegalov Mar 24, 2022 •

edited

Loading

razajafri Mar 24, 2022

razajafri Mar 24, 2022

gerashegalov Mar 24, 2022

gerashegalov Mar 24, 2022

razajafri Mar 24, 2022

gerashegalov Mar 24, 2022

razajafri commented Mar 24, 2022

Removed limit from the test #5033

Removed limit from the test #5033

Conversation

razajafri commented Mar 23, 2022 • edited Loading

razajafri commented Mar 24, 2022

gerashegalov Mar 24, 2022 • edited Loading

Choose a reason for hiding this comment

razajafri Mar 24, 2022

Choose a reason for hiding this comment

razajafri Mar 24, 2022

Choose a reason for hiding this comment

gerashegalov Mar 24, 2022

Choose a reason for hiding this comment

gerashegalov Mar 24, 2022

Choose a reason for hiding this comment

razajafri Mar 24, 2022

Choose a reason for hiding this comment

gerashegalov Mar 24, 2022

Choose a reason for hiding this comment

razajafri commented Mar 24, 2022

razajafri commented Mar 23, 2022 •

edited

Loading

gerashegalov Mar 24, 2022 •

edited

Loading