-
Notifications
You must be signed in to change notification settings - Fork 60
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add possibility to retrospectively promote subset collections to full collections #655
Comments
One general approach to solving this that piques my interest is a "tree-shaking" algorithm. The user would specify which collections must be output. Podio then starts by marking these collections as "live". It follows all associations backwards, across all collections, marking everything it encounters as live. Anything not live at the end does not get written. This would strike a good balance between saving space and preserving associations and hence data integrity. If the runtime cost is too high, we could reduce it by having the user specify exactly which collections need to be pruned. |
That sounds like an interesting approach. I think it could work, there might be some edge cases to be considered. One potential issue is the following: All objects are identified by their |
As discussed during the EDM4hep meeting on Sep 10 we don't think a truly generic solution is possible or this. Hence, we decided that at least for the foreseeable future the developments in this direction will (and should) focus on implementing the necessary functionality to make things work for specific use cases where the expected outcome is well defined, e.g. skimming MCParticles |
See also discussion in: key4hep/k4FWCore#226
The text was updated successfully, but these errors were encountered: