Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Try to fix dropped data in P2P shuffle #6278

Closed
wants to merge 1 commit into from

Conversation

gjoseph92
Copy link
Collaborator

Attempt with @graingert at fixing #6277; doesn't work.

Also fixes a couple other small design issues (ShuffleExtension peeking into implementation details of Shuffle during shuffle_receive/shuffle_inputs_done.)

  • Tests added / passed
  • Passes pre-commit run --all-files

@@ -324,7 +350,10 @@ def get_output_partition(
shuffle = self.shuffles.pop(shuffle_id, None)
# key missing if another thread got to it first
if shuffle:
shuffle.close()
sync(
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it's nicer to combine consecutive sync calls into one async def:

async def close():
    await shuffle.close()
    await self.worker.scheduler.shuffle_register_complete(id=shuffle_id, worker=self.worker.address)

sync(self.worker.loop, close)

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Agreed, I was just lazy here

@github-actions
Copy link
Contributor

github-actions bot commented May 5, 2022

Unit Test Results

       16 files  ±0         16 suites  ±0   7h 56m 45s ⏱️ + 35m 11s
  2 756 tests ±0    2 673 ✔️  -   1       78 💤 ±0    4 ±0  1 🔥 +1 
22 010 runs  ±0  20 975 ✔️  - 13  1 022 💤 +4  12 +8  1 🔥 +1 

For more details on these failures and errors, see this check.

Results for commit cdbddc4. ± Comparison against base commit 7bd6442.

@graingert
Copy link
Member

see #6279

@fjetter
Copy link
Member

fjetter commented Oct 19, 2022

Next time something like this happens, please provide a bit of information why it doesn't work. A reproducer of the issue would also be welcome.

@hendrikmakait
Copy link
Member

This should be superseded by #7486 and previous changes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants