dvc pull --glob tries to pull dvcignored files and glob pattern is only applied to files that already exist #5864
Labels
A: data-sync
Related to dvc get/fetch/import/pull/push
bug
Did we break something?
p2-medium
Medium priority, should be done, but less important
As mentioned in topic I have found two issues
Explaination 1:
Lets assume I have a folder with two json files: file_1.json and file_2.json
file_1.json is in dvcignore
I run the command "dvc pull --glob "*.json"
The return is:
"ERROR: failed to pull data from the cloud - 'file_1.json' does not exist as an output or a stage name in 'dvc.yaml': 'dvc.yaml' does not exist"
Explaination 2:
Lets assume I have a folder with one json file and one png file but this files but the files are NOT in the current directory yet but only the .dvc files
I run the command "dvc pull --glob "*.json"
I would expect hat only the .json file is pulled but instead the .png file is pulled as well.
I got following explaination:
"The problem is that the glob pattern is only applied to files that are already in your local workspace (so it works for pulling updated versions but not for pulling new files). if you haven't pulled anything yet, it will return an empty list of pattern matches, and then DVC falls back to the default "pull everything" behavior"
The text was updated successfully, but these errors were encountered: