Make `Worker.delete_data` sync #3922

pentschev · 2020-06-23T13:39:38Z

This should fix issues where Worker.delete_data isn't awaited.

mrocklin · 2020-06-23T14:41:14Z

distributed/worker.py

+                self.scheduler.remove_keys(
                    address=self.contact_address, keys=list(keys)
                )


If we don't await a coroutine then it won't occur. So this code probably doesn't make sense.

However, the TODO note just above it is interesting. Maybe this code isn't necessary? It might be worth looking at git blame here to see why it was added.

It's even harder to await a function that doesn't exist, which is the case here. 😄

With git log -c -S remove_keys distributed/scheduler.py, I managed to find that this function was removed somewhere in between dc54748 and 2c175e8 . Unfortunately that dates back to 2016, I believe this was a leftover from some commit around that time, and I can remove that line if it makes sense, but unfortunately I can't dig in deeper.

Yeah I'd be +1 to just dropping this and the TODO.

I tentatively dropped it on the newest commit, if there are any concerns with that I'll revert it.

mrocklin · 2020-06-23T14:42:28Z

distributed/tests/test_worker.py

+    df = dask.datasets.timeseries(dtypes={"x": int, "y": float}, freq="1s").reset_index(
+        drop=True
+    )
+
+    await client.compute(df.map_partitions(lambda df: df.__sizeof__()))


Is everything here necessary?

Seems like getting rid of await client.compute(df.map_partitions(lambda df: df.__sizeof__())) works just the same, so just the dataframe creation suffices.

Removed in latest commits.

mrocklin · 2020-06-23T14:42:54Z

distributed/tests/test_worker.py

+    del df
+    await client.run(_check_data, False)
+
+    del df2
+    await client.run(_check_data, True)


To simplify this logic I recommend that you just check w.data

I assume you're suggesting assert len(w.data) > 0/assert len(w.data) == 0, is that right? If so, that unfortunately fails on the second assertion, could this be some async black magic? E.g., client.run will force some syncing while w.data is still outdated?

client.run doesn't force any synchronization, but maybe the GC hasn't run yet at this point. Regardless we should probably make this robust to GC taking a while (GC delays are a major source of intermittent testing failures for us). Most tests of this form look like the following:

start = time() while w.data: await asyncio.sleep(0.01) assert time() < start + 2

Thanks for the explanation @mrocklin , your suggestion works and that's fixed in latest commits.

mrocklin · 2020-06-23T18:46:48Z

distributed/tests/test_worker.py

+    df = dask.datasets.timeseries(dtypes={"x": int, "y": float}, freq="1s").reset_index(
+        drop=True
+    )


Do we need a dataset this complex? Do we need these datatypes or the reset_index call? If not, lets remove them to keep things as simple as possible for future reviewers.

Almost all tests in this file intentionally use very simple computations, like future = client.submit(inc, 1). I recommend that we do the same here unless having a dataframe with a reset index is critical to test the delete_data operation.

Actually, no. I can simplify that a bit.

distributed/tests/test_worker.py

pentschev · 2020-06-26T17:12:33Z

@mrocklin I dropped the test and left only the actual code changes. From my side it's good to go, let me know if there are other things to be addressed.

mrocklin · 2020-06-26T21:03:53Z

This is in. Thanks @pentschev

pentschev · 2020-06-26T21:06:31Z

Thanks @mrocklin for reviewing and merging!

pentschev added 2 commits June 23, 2020 06:06

Make Worker.delete_data sync, as Worker.update_data

4827a51

Add test for Worker.delete_data

97e0407

pentschev mentioned this pull request Jun 23, 2020

Await on handle_stream raises missing delete_data await warning #3920

Closed

Fix linting errors

ec197c9

mrocklin reviewed Jun 23, 2020

View reviewed changes

pentschev added 2 commits June 23, 2020 11:06

Simplify and improve test_delete_data

096ee3d

Remove call to non-existing scheduler.remove_keys

3ee1e45

mrocklin reviewed Jun 23, 2020

View reviewed changes

distributed/tests/test_worker.py Outdated Show resolved Hide resolved

jakirkham mentioned this pull request Jun 23, 2020

Skip 2nd serialization pass of DeviceSerialized rapidsai/dask-cuda#309

Merged

Drop test_delete_data

525598b

mrocklin merged commit 93701f8 into dask:master Jun 26, 2020

pentschev deleted the make-delete_data-sync branch November 11, 2020 17:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make `Worker.delete_data` sync #3922

Make `Worker.delete_data` sync #3922

pentschev commented Jun 23, 2020

mrocklin Jun 23, 2020

pentschev Jun 23, 2020

jakirkham Jun 23, 2020

pentschev Jun 23, 2020

mrocklin Jun 23, 2020

pentschev Jun 23, 2020

pentschev Jun 23, 2020

mrocklin Jun 23, 2020

pentschev Jun 23, 2020

mrocklin Jun 23, 2020

pentschev Jun 23, 2020

mrocklin Jun 23, 2020

pentschev Jun 23, 2020

pentschev commented Jun 26, 2020

mrocklin commented Jun 26, 2020

pentschev commented Jun 26, 2020

Make Worker.delete_data sync #3922

Make Worker.delete_data sync #3922

Conversation

pentschev commented Jun 23, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pentschev commented Jun 26, 2020

mrocklin commented Jun 26, 2020

pentschev commented Jun 26, 2020

Make `Worker.delete_data` sync #3922

Make `Worker.delete_data` sync #3922