-
Notifications
You must be signed in to change notification settings - Fork 113
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Testing Path Fix #1853
Testing Path Fix #1853
Conversation
Signed-off-by: ravi-kumar-pilla <ravi_kumar_pilla@mckinsey.com>
Signed-off-by: ravi-kumar-pilla <ravi_kumar_pilla@mckinsey.com>
Signed-off-by: ravi-kumar-pilla <ravi_kumar_pilla@mckinsey.com>
Signed-off-by: ravi-kumar-pilla <ravi_kumar_pilla@mckinsey.com>
Signed-off-by: ravi-kumar-pilla <ravi_kumar_pilla@mckinsey.com>
Signed-off-by: ravi-kumar-pilla <ravi_kumar_pilla@mckinsey.com>
also looping in @merelcht , does it make sense to fix this issue on kedro-datasets side? are we strict on return type validation for file_path? pls let us know. |
Signed-off-by: ravi-kumar-pilla <ravi_kumar_pilla@mckinsey.com>
Had a discussion on slack and Nok mentioned that filepath is not a well defined API, means there may be entries of datasets which do not have filepath. Though I still think if there is filepath, the type should be consistent. From Viz perspective, we can have the conversion to be foolproof. Thank you |
@@ -135,7 +135,9 @@ def get_file_size(self, dataset: Any) -> Union[int, None]: | |||
return None | |||
|
|||
try: | |||
file_path = get_filepath_str(Path(dataset._filepath), dataset._protocol) | |||
file_path = get_filepath_str( | |||
PurePosixPath(dataset._filepath), dataset._protocol |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I am confused, Path
usually is the platform-agnostic way and works on both Windows and Linux. But PurePosixPath wouldn't work on Windows, so why are we changing this. Happy to get on a call to understand better.
…x/win_build Signed-off-by: ravi-kumar-pilla <ravi_kumar_pilla@mckinsey.com>
Signed-off-by: ravi-kumar-pilla <ravi_kumar_pilla@mckinsey.com>
Signed-off-by: ravi-kumar-pilla <ravi_kumar_pilla@mckinsey.com>
Signed-off-by: ravi-kumar-pilla <ravi_kumar_pilla@mckinsey.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM :) thanks Ravi -- great debugging
Description
Testing windows build failure due to StatsHook on main branch.
NOTE: While I tested on windows machine locally, it was working fine. However on CircleCI, filepath has
C:/
, but the system is trying to findC:
(missing the forward slash). After looking into plugins kedro-datasets further, some dataset._filepath returns PurePosixPath while some return string. The fix for #1797 should be made on the datasets side to be consistent.Development notes
The pandas CSV, Excel, Feather, Generic, HDF, json, Parquet, xml has a code block -
while others like deltatable, gbq, sql keep filepath as
str
. I am not completely aware of the reason for this difference. @SajidAlamQB please add if you have any information on this.If this is not intentional, we should be fixing the issue on kedro-datasets.
NOTE:
continue_config.yml
changes are for testing windows build and they will be discarded before merging to mainQA notes
Checklist
RELEASE.md
file