-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add: from external FS or local external outputs #1545
Comments
This is just a usual "external output" case, where you want to extend your project's workspace with the workspace of your entire machine. I don't think it is a bug, but rather a miscommunication. I guess people expect |
The guy saw I expect this |
@dmpetrov I see, this is the guy from ODS you are talking about. Agreed. I think we could simply add a confirmation prompt for external local files along with something like |
Yeah, that's what I was thinking about. But let's think a bit more about this experience. |
Or, as discussed with @shcheklein , we could require a special external cache setup for external outputs, same as we do with s3/gs/etc. That way it would error-out automatically, without us having to introduce --external flag, that is breaking the symmetry for s3/gs/etc external outputs. |
More context on this - https://discuss.dvc.org/t/share-nas-data-in-server/180 . It looks it's a really common thing and the way our "external outputs" behave in this case is very confusing. |
The scenario with external outputs makes seen but it is too advanced for Let's prevent using external outputs in |
It is also related to #2955. DVC needs to resolve paths, exits with an error if the path is outside the current repo (no external outputs). The paths inside the repo need to be converted to relevant paths. |
Had a great discussion with @dmpetrov and @shcheklein that it might make sense to introduce a config option instead of But a proper solution here, is probably to introduce a notion of workspaces, that will enforce proper use. E.g. maybe something like
but it should carefully evaluated. We will do that later. For now we need to prevent new users from shooting themselves in the leg. |
I have some questions about the new
|
@jorgeorpinel Yep, https://github.com/iterative/dvc/pull/3929/files#diff-43298ed5b9e6a1345c43c8aedf6971c3R92
Because using them is safe and we haven't seen people misusing it. |
OK. But |
@jorgeorpinel Agreed, the name could be made more explicit, but --external was added to simply prevent the misuse, it is only meant for advanced users. The external outs scenario is going to be revisited in the future, so I wouldn't bother renaming it again until then (at which point we will likely get rid of it). |
Sounds good. |
It copies the file into the cache, removes the original file in the external drive and creates symlink instead (to the dvc cache).
Based on some offline discussions it confuses people. One of the biggest concerns - after removing DVC project (the entire dir with the code as well as DVC-cache) user sees a symlink to a file which does not exist.
reddit-May2015_z.tsv.zip.dvc
file but not the actual file in the workspace (likereddit-May2015_z.tsv.zip
). It is not clear how to instantiate the file in the workspace?The text was updated successfully, but these errors were encountered: