-
Notifications
You must be signed in to change notification settings - Fork 69
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Problem with the cache in GDriveFileSystem.find
#229
Comments
@simone-viozzi yes, thanks for creating the ticket. |
Any idea of a roadmap to fix it? With this broken #222 is pretty much useless, it will work unexpectedly 90% of the time. I could try, but I still don't understand how the cache works right now, and how it should work. |
Hmm, I think |
|
okay, problems that I see:
|
Hey @shcheklein, did you know that And also |
@simone-viozzi yes, I know. The default implementation was not good enough for the DVC that is the major driver / consumer of this fsspec for now. |
While working on #222, I discovered that
find
has a bug with the cache.Let assume
self.path=root/tmp/
and a folder structure like:now let's do some tests:
and that is correct,
but if we do only
find('root/tmp/fo1/fo2')
:This happens because find relay on the cache, and at the start the cache is only populated with ids from one level down
self.path
so in the last example, the content of the cache is:
I think, because there is no
tmp/fo1/fo2
(the starting path of find),query_ids
stays empty and the method return an empty list.The lines of code involved are:
PyDrive2/pydrive2/fs/spec.py
Lines 469 to 483 in 27bbf4c
The text was updated successfully, but these errors were encountered: