You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Overview of the Feature Request
The reindexFilesInBatches method actually indexes in one batch and, when called from indexPermissionsOnSelfAndChildren for a permission change on a collection, results in a list of all datafiles in datasets in that collection and all permission docs for them being loaded into memory. Adding some batching would reduce the memory requirements significantly.
What inspired the request?
Out-of-memory errors at QDR
PR to follow
The text was updated successfully, but these errors were encountered:
* reindex batches of 20 files instead of all at once
* Also only keep 100 files in list at a time
* release note
* Just do collections/datasets as you go
Avoids keeping everything in memory, also helps in tracking progress as
you can see the permissionindextime getting updated per dataset.
* fix merge issues, add logging
* put comments back to how they were #10697
* reduce logging #10697
* rename release note and add PR number #10697
* fix logging - finest for per file, space in message
* adding a space in log message - per review
---------
Co-authored-by: Philip Durbin <philip_durbin@harvard.edu>
Overview of the Feature Request
The reindexFilesInBatches method actually indexes in one batch and, when called from indexPermissionsOnSelfAndChildren for a permission change on a collection, results in a list of all datafiles in datasets in that collection and all permission docs for them being loaded into memory. Adding some batching would reduce the memory requirements significantly.
What inspired the request?
Out-of-memory errors at QDR
PR to follow
The text was updated successfully, but these errors were encountered: