-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. Weβll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cloud versioning: fix push/fetch performance #8766
Conversation
@dberenbaum this PR also adds the change where we will not check whether existing It is straightforward to remove that optimization in the future though if we get user feedback, so we can probably merge this for now. |
Codecov ReportBase: 93.54% // Head: 93.55% // Increases project coverage by
Additional details and impacted files@@ Coverage Diff @@
## main #8766 +/- ##
=======================================
Coverage 93.54% 93.55%
=======================================
Files 457 457
Lines 36255 36253 -2
Branches 5262 5261 -1
=======================================
Hits 33915 33915
+ Misses 1835 1833 -2
Partials 505 505
Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. β View full report at Codecov. |
@pmrowla I agree that assuming the version exists might not be a safe assumption. Do you have other ideas for how to speed up checking status for Edit: maybe related to #7268. If we had some option to force verification, it might be fine to default to assuming the version exists. |
The main problem here is that for regular DVC we are able to speed up the status check with:
But we can't use either of these optimizations with cloud versioning. Assuming that if we have a version ID we don't need to push is similar to the .dir optimization, but the difference there would be that we at least check one file in the remote (the .dir file), and for regular DVC remotes the expectation is that DVC is the only thing modifying the entire bucket (which is not necessarily the case with cloud versioning). Since we can't actually do any size estimation for versioned remotes, I suppose we could try running both the sequential |
fb54f52
to
1ad5055
Compare
Can we leave out the status optimizations to not hold this up? Let's discuss status optimization separately. |
0a83c61
to
c1ac943
Compare
@dberenbaum I removed the status optimization in this PR, it should be ready to merge now |
Thanks @pmrowla! Great to see this resolved so fast. |
c1ac943
to
3232756
Compare
β I have followed the Contributing to DVC checklist.
π If this PR requires documentation updates, I have created a separate PR (or issue, at least) in dvc.org and linked it here.
Thank you for the contribution - we'll try to review it as soon as possible. π
Will close #8359