Skip to content

Best options for efficient deduplication? #63

Answered by mhx
M-Gonzalo asked this question in Q&A
Discussion options

You must be logged in to vote

Interesting use case, thanks for the feedback!

My gut feeling when reading this was that it's simply because the lookback buffer is too small that mkdwarfs isn't able to pick up the redundancies. The rationale behind keeping the lookback buffer size limited is for a more typical use case: say you have a file that's relatively small compared to the configured filesystem block size. Even if you were able to assemble that file mostly from chunks split across 20 different filesystem blocks, you'd rarely want to, because you'd have to decompress 20 filesystem blocks in order to re-assemble the file when mounting the filesystem image.

However, your use case is just begging for a) large filesyst…

Replies: 6 comments 40 replies

Comment options

You must be logged in to vote
3 replies
@M-Gonzalo
Comment options

@Phantop
Comment options

@M-Gonzalo
Comment options

Comment options

You must be logged in to vote
1 reply
@mhx
Comment options

mhx Oct 24, 2021
Maintainer

Answer selected by M-Gonzalo
Comment options

You must be logged in to vote
3 replies
@mhx
Comment options

mhx Oct 24, 2021
Maintainer

@Phantop
Comment options

@mhx
Comment options

mhx Oct 25, 2021
Maintainer

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
31 replies
@Phantop
Comment options

@mhx
Comment options

mhx Oct 27, 2021
Maintainer

@mhx
Comment options

mhx Oct 27, 2021
Maintainer

@Phantop
Comment options

@mhx
Comment options

mhx Oct 29, 2021
Maintainer

Comment options

You must be logged in to vote
2 replies
@mhx
Comment options

mhx Oct 26, 2021
Maintainer

@M-Gonzalo
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants