Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

task: optionally skip downloading artefacts on destroy #303

Closed
Tracked by #362
DavidGOrtega opened this issue Nov 25, 2021 · 6 comments · Fixed by #395
Closed
Tracked by #362

task: optionally skip downloading artefacts on destroy #303

DavidGOrtega opened this issue Nov 25, 2021 · 6 comments · Fixed by #395
Assignees
Labels
enhancement New feature or request p1-important High priority resource-task iterative_task TF resource ui/ux User interface/experience

Comments

@DavidGOrtega
Copy link
Contributor

DavidGOrtega commented Nov 25, 2021

Right now destroy syncs data having unnecessary cloud traffic (to be paid). If i do not care about the data back we can have a flag pullOnDestroy to be able to set this like #301

@DavidGOrtega DavidGOrtega added enhancement New feature or request resource-task iterative_task TF resource labels Nov 25, 2021
@0x2b3bfa0
Copy link
Member

having unnecessary cloud traffic (to be paid)

Cents, if you use directory for code synchronization. If you use it as an intermediate cache for data, it might cost a lot more.

We can either allow this choice or create a separate cache directory that never gets synchronized back; it would be used to store the intermediate data (not artifacts) using during the training process.

@DavidGOrtega
Copy link
Contributor Author

DavidGOrtega commented Nov 25, 2021

We can either allow this choice

I prefer also this

Cents, if you use directory for code synchronization.

also the time to sync, can be hours were your machine is asking to die. For some reason it takes some CPU in my local machine

@0x2b3bfa0
Copy link
Member

#307 could reduce considerably the pull times, especially when there is nothing to pull. 😜 Still having an option to disable pulling the data doesn't seem like a bad idea either.

@DavidGOrtega
Copy link
Contributor Author

#307 could reduce considerably the pull times,

Why #307 would reduce the time?

@0x2b3bfa0
Copy link
Member

Why #307 would reduce the time?

Because users are the ones who choose what gets downloaded and what doesn't. Still, if artifacts are huge it will be the same.

@casperdcl casperdcl changed the title task destroy, I do not care of the artefacts. Can we avoid the sync? skip downloading artefacts on task destroy Jan 11, 2022
@casperdcl casperdcl changed the title skip downloading artefacts on task destroy task: optionally skip downloading artefacts on destroy Jan 12, 2022
@MaxHuerlimann
Copy link

Would also be interested in this feature. In our use-case with DVC we can handle updating remote cache and creating pull request on the remote cloud machine after model training, so the data syncing back to the CI/CD runner machine is not needed and only increases pipeline run times unnecessarily.

@0x2b3bfa0 0x2b3bfa0 self-assigned this Feb 15, 2022
@0x2b3bfa0 0x2b3bfa0 added p1-important High priority ui/ux User interface/experience labels Feb 15, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request p1-important High priority resource-task iterative_task TF resource ui/ux User interface/experience
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants