-
Notifications
You must be signed in to change notification settings - Fork 893
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[KED-1796] Problems with relative paths under Windows #412
Labels
Issue: Bug Report 🐞
Bug that needs to be fixed
Comments
@fdroessler Thank you for reporting this! We've been working through a backlog of some Windows specific issues and we'll add this one to the list. Once we have a solution for this, we'll share what that is and release a bugfix. |
idanov
changed the title
Problems with relative paths under Windows
[KED-1796] Problems with relative paths under Windows
Jun 18, 2020
Linking in issue #390 since it seems that it's reporting the same problem. |
@fdroessler Thank you for reporting this issue. It was fixed in 390c02f commit. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hopefully this error is not due to a change in kedro that I have missed but I had a couple of people reporting this to me.
Description
Switching from absolute paths to relative paths results in some (maybe cached) state where kedro tries to load a non-existing version of a dataset.
Context
We have developed a ML pipeline for which the data was stored in a separate partition (
D:/data
) and the kedro projekt repo was on something likeC:/code/project
. Subsequently on a different machine, a colleague cloned the repo and copied the data over into the data folder within the project repo. Now the code was inC:/code/
and the data inC:/code/data/
and the paths incatalog.yml
were adjusted to reflect the new location using relative paths. When starting IPython via the kedro command (kedro ipython
) everything seemed ok but trying to load a versioned dataset resulted in an error. The error said that a specific version of the dataset could not be loaded from disk. Error:VersionNotFoundError:
Trying to explicitly load a version that exists on the machine resulted in an error because for some reason a
C:/
was appended to the file URI so that it looked like this:<path to versioned file>C:
.`01_raw/preprocessed_shuttles.csv/2020-06-15T07.44.54.647Z/C:'
If instead of relative paths
data/dataset
absolute paths are usedC:/..../kedro_project/data/dataset
everything works as expected.Steps to Reproduce using space flight tutorial
data/...
kedro ipython
and try loadingcatalog.load("preprocessed_shuttles")
orcatalog.load("preprocessed_shuttles", version='2020-06-15T07.44.54.647Z')
to see both errors.Expected Result
Kedro should load the latest dataset on disk independent of absolute and relative paths.
Actual Result
Copying the data and changing the yaml to relative paths results in error messages trying to load none existing versions.
Your Environment
Include as many relevant details about the environment in which you experienced the bug:
pip show kedro
orkedro -V
): 0.16.1python -V
): 3.7.7The text was updated successfully, but these errors were encountered: