Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clean up of unpack_remotedata() #7322

Merged
merged 3 commits into from
Nov 24, 2022
Merged

Conversation

madsbk
Copy link
Contributor

@madsbk madsbk commented Nov 17, 2022

Separate the handling of futures with the argument of a SubgraphCallable and futures within the SubgraphCallable itself.

Closes #7289
Closes #7303

  • Tests added / passed
  • Passes pre-commit run --all-files

@github-actions
Copy link
Contributor

github-actions bot commented Nov 17, 2022

Unit Test Results

See test report for an extended history of previous test failures. This is useful for diagnosing flaky tests.

       15 files  ±  0         15 suites  ±0   6h 44m 21s ⏱️ - 15m 51s
  3 226 tests +  5    3 141 ✔️ +  4    84 💤 ±0  1 +1 
23 852 runs  +35  22 942 ✔️ +34  909 💤 ±0  1 +1 

For more details on these failures, see this check.

Results for commit dd7479a. ± Comparison against base commit 69b74f9.

♻️ This comment has been updated with latest results.

@mrocklin
Copy link
Member

This does appear to fix the issue that I was seeing. I'm curious, what is your understanding of the problem and what did you do here to fix it?

@madsbk
Copy link
Contributor Author

madsbk commented Nov 17, 2022

This does appear to fix the issue that I was seeing. I'm curious, what is your understanding of the problem and what did you do here to fix it?

We need to do two things when unpacking futures surrounding SubgraphCallable:

  • We have to unpack all futures in the graph of the SubgraphCallable and add them to the input keys of the SubgraphCallable.
  • Also, we have to unpack futures in the arguments given to the SubgraphCallable. Since they are not used within the SubgraphCallable itself, they do not need to be added to the input keys of the SubgraphCallable, which is what we use to do.

@madsbk madsbk marked this pull request as ready for review November 17, 2022 15:11
@mrocklin
Copy link
Member

@fjetter is this something that you and your team can help to review?

@mrocklin
Copy link
Member

Also I hope that eventually something like #6028 makes things like this irrelevant

@fjetter
Copy link
Member

fjetter commented Nov 21, 2022

Changes to the code appear to be straightforward. Can we add a test for #7289 please? It sounds like we should be able to test this by asserting that the object is not in the output multiple times, shouldn't we?

@madsbk
Copy link
Contributor Author

madsbk commented Nov 21, 2022

Changes to the code appear to be straightforward. Can we add a test for #7289 please? It sounds like we should be able to test this by asserting that the object is not in the output multiple times, shouldn't we?

@mrocklin do you have a small reproducer? Otherwise, I can try to make a synthetic one.

@mrocklin
Copy link
Member

mrocklin commented Nov 21, 2022 via email

@madsbk
Copy link
Contributor Author

madsbk commented Nov 22, 2022

@fjetter added a test: dd7479a

Copy link
Member

@fjetter fjetter left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @madsbk !

@fjetter fjetter merged commit cff33d5 into dask:main Nov 24, 2022
@madsbk madsbk deleted the unpack_remotedata_cleanup branch December 7, 2022 13:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Numba serialization is slow sometimes
3 participants