-
Notifications
You must be signed in to change notification settings - Fork 15.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GoogleDriveLoader seems to be pulling trashed documents from the folder #5104
Comments
Do you have more details on reproducing? Are you using Looking through the code, if On the other hand, if |
It's happening the same to me, so I've recorded a video showing the issue. This is the script I've used for the test:
|
It's working well if I go to the trash in Google Drive, and click "Empty Trash". |
+1, I just tested this and saw the same behavior. This seems like a Google Drive pattern. |
I found a way to check if a file is trashed, fill send out a PR tonight. |
…issue #5104) (#5220) # Change Default GoogleDriveLoader Behavior to not Load Trashed Files (issue #5104) Fixes #5104 If the previous behavior of loading files that used to live in the folder, but are now trashed, you can use the `load_trashed_files` parameter: ``` loader = GoogleDriveLoader( folder_id="1yucgL9WGgWZdM1TOuKkeghlPizuzMYb5", recursive=False, load_trashed_files=True ) ``` As not loading trashed files should be expected behavior, should we 1. even provide the `load_trashed_files` parameter? 2. add documentation? Feels most users will stick with default behavior ## Who can review? Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested: DataLoaders - @eyurtsev Twitter: [@nicholasliu77](https://twitter.com/nicholasliu77)
…issue langchain-ai#5104) (langchain-ai#5220) # Change Default GoogleDriveLoader Behavior to not Load Trashed Files (issue langchain-ai#5104) Fixes langchain-ai#5104 If the previous behavior of loading files that used to live in the folder, but are now trashed, you can use the `load_trashed_files` parameter: ``` loader = GoogleDriveLoader( folder_id="1yucgL9WGgWZdM1TOuKkeghlPizuzMYb5", recursive=False, load_trashed_files=True ) ``` As not loading trashed files should be expected behavior, should we 1. even provide the `load_trashed_files` parameter? 2. add documentation? Feels most users will stick with default behavior ## Who can review? Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested: DataLoaders - @eyurtsev Twitter: [@nicholasliu77](https://twitter.com/nicholasliu77)
System Info
Hi
testing this loader, it looks as tho this is pulling trashed files from folders. I think this should be default to false if anything and be an opt in.
Who can help?
No response
Information
Related Components
Reproduction
use GoogleDriveLoader
Reindex
File still can be searched in vector store.
Expected behavior
Should not be searchable
The text was updated successfully, but these errors were encountered: