-
-
Notifications
You must be signed in to change notification settings - Fork 743
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
borg2: there is no csize #6763
borg2: there is no csize #6763
Conversation
Pika Backup currently repots an 'additional backup space used' during backup. It reports the deduplicated size, I'm not even sure how the compression is counted here. I don't care that much about compression here but the deduplicated size is was always a good indicator for me during a backup: If it's suddenly uploading a lot of data but you would have expected everything to be covered by deduplication, you might have missed excluding something. However, this all does not matter during scheduled backups. |
f745242
to
4ac880e
Compare
so we do not need csize for it.
also: remove pre12_meta cache
4ac880e
to
822aefc
Compare
the size is already remove via Item._update_internal method.
@sophie-h I re-added the "deduplicated size", just changed its meaning: It is now based on the uncompressed size (this is because we do not have the compressed size any more). So this can still be used as an indicator for sudden unexpected backup growth (because some exclude is missing or some crap unintentionally got backed up). |
What's currently missing is an indication of how good the compression worked (info, stats, placeholders). But guess we could re-add this later in a different way. After #6689 is merged, we have the "csize" (or something closely related) in the repo index, so after adding some api method, we could compute some statistics based on that. Or, maybe even more efficient: we could just log compressor stats at the end of a backup. |
"what would we lose/win if we would just not track csize (the compressed size of a chunk)", see #2357.
What we lose (some stuff could be just done differently):
What we win:
fetch_missing_csize
and nozero_csize_ids
any more for the (still experimental)AdHocCache
Also: