-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
import/get directory with git files and dvc outputs #3087
Comments
Hi @JPFrancoia ! Your workflow is correct, looks like there is a bug on our side. Could you show log for |
Hi @efiop , Here is the log for the pull:
And for the dvc version:
But I also realized just now that my |
Reproduction script:
Expected behavior: |
@JPFrancoia that definitely looks like bug on our side, I would recommend importing particular |
yes it is. I can also create a .dvc file for the directory in the original repo, it doesn't matter too much. I was just wondering why the feature wouldn't work.
My preference would be this solution. |
Got it, I can totally understand that it can be the desired behavior. I don't see any obvious reasons why shouldn't we allow that, @efiop, what do you think? |
Side note: if we are to support this |
@pared Ok, so from formal standpoint that Ok, now back to the general issue of dvc allowing you to import a git directory that contains some cached dvc outputs, but not bringing the outputs along with it. I would say that intuition tells me that it should download git files as well as cached dvc outputs, because that is a natural state of a dvc repo. But it is also understandable that maybe users would want to sometimes bring only git files or dv cfiles by that filter, for which we could introduce special flags later, but I wouldn't introduce those until requested. Also, in retrospective, we might've been better off by only allowing get/import for git files when a special flag |
@pared Not sure I understand this one, could you elaborate, please? |
@efiop That was wrong on my side, described behavior would be similar to [EDIT] |
@pared Not sure I follow, why it shouldn't take place? In my opinion |
@efiop The problem is that target directory conains |
For the record: p0 part is :
|
Submitted #3151 to unblock another ticket , but it is not solving the pull issue though. In |
URL should be required and rev or rev_lock should be optional. Related to iterative#3087
Note: |
recursively added DVC outs * Partial fix for iterative#3087
After the changes for #3811 we have proper support for directories with mixed git files and DVC outputs (and dvcfiles are filtered out on import). So with a slightly modified reproduction script:
with the source repo
(where 3 is a git versioned file) running
so both DVC outs and the git-only file are imported, and we only generate adding an |
DVC version: 0.80.0
DVC installed with Brew, on Mac OS
I'm trying to import/fetch external data.
I just did a:
which gives me a
data
folder and adata.dvc
files in my current folder. Now, I'd like to pull the files contained in thedata
folder. The files are stored in a S3 bucket.But I immediately get this error:
Am I using the right workflow? How do I solve this issue?
Cheers
The text was updated successfully, but these errors were encountered: