-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Streamlines task in parallel using ray #1136
Conversation
…tiple trx files in parallel and concats upon finish
Hello @asagilmore! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:
Comment last updated at 2024-05-24 20:00:56 UTC |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we incorporate this into the example?
@36000 : any objections? We'd need to install ray on the CI.
To handle non-random seeding, where there may be less that 1 seed per chunk after splitting up the seeds for ray workers
And then using that function here:
This works well, but this bypasses the default arguments from track(), which I have copied to the helper function. If those are ever changed the default arugments here would not be changed and it would exhibit unexpected behavior. Is there any way to make this always match the track() default arugments? |
From my perspective this looks good now. You can add Line 97 in 2889d7a
Then you can also try making it so the |
+1 Have you had a chance to run some more experiments with this? Are you still consistently seeing pretty substantial speedup with this? |
Nice! ⚡ I think that we can mark this ready to review and after you add it to the dependencies and documentation, we can go ahead and merge it. For now, we can keep this as an optional feature, but we might consider setting this as default on some future release (probably after we let users kick the tires on this some more). |
I think I have correctly added it to the documentation and dependencies, but please let me know if I have done it incorrectly I am not sure exactly where everything is supposed to be. |
86b9240
to
908fe7d
Compare
908fe7d
to
87e5ebe
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just some small comments. Your last commit is now running on the CI, so we'll see how that works on the documentation build action.
I've set this to "Ready for review" and removed the "WIP" from the title of the PR, because I think we're close. |
73f11f7
to
7a6d2a1
Compare
Co-authored-by: Ariel Rokem <arokem@gmail.com>
this looks good, if the docs pass, we can merge! |
This adds a num_chunks argument to tracking_params which causes the streamlines task to generate multiple trx files in parallel using ray, and then concatenate them at the end.