feat: add zstd compression #451

Rdataflow · 2025-01-08T16:59:53Z

@constantinius this PR brings ZSTD support 😄
thanks for your review ...

Closes: #372

constantinius · 2025-01-09T10:35:20Z

src/compression/zstd.js

+
+export default class ZstdDecoder extends BaseDecoder {
+  decodeBlock(buffer) {
+    return zstd.decode(new Uint8Array(buffer), 1000_000_000).buffer;


The second parameter is uncompressedSize, which is fixed here to 1 million. Is this significant? can we actually know it beforehand? What if the size is actually smaller/larger?

Reading the source: if uninitialized (i.e: 0) the size should automatically be found out. Wouldn't that be a better solution?

I'm worried, that if the actual size is larger than the 1 million bytes buffer (which can easily be reached by 1024x1024 tile sizes) we might run into troubles.

@constantinius this value of 1 billion has been approximated using trial and error.
AFAICS 0 or omitting the param fails in the CI tests for some unknown reason.
however huge numbers make it pass w/o issues.
I couldn't observe any negative impact neither with 100 billion. (i.e. no mem limits hit)
for the full journey see this branch https://github.com/Rdataflow/geotiff.js/commits/ci-test/ and it's CI tests failing or passing
that's what drove us here... but maybe there exist even better ideas?

Hey @Rdataflow

I investigated this. There is a function in zstd to determine the uncompressed size of a chunk. This is a property set in the chunk itself which is optional. When it is optional, zstd says to read the chunk in streaming mode. Otherwise proper decompression is not guaranteed. And this is something I'd like to avoid, since it may work on our test files, but not on random files used by people. So I'm against abusing the zstd library that way. Unfortunately, the zstd library we are currently using (and no other one I investigated) supports streaming decompression as it does not wrap the necessary functions of the wasm library.

I'm now trying to implement this streaming decoding into the zstd library. I'm thinking of incorporating that into the source code of geotiff.js directly, as it may be easier to handle.

Hey @constantinius

Great you'll an even better way to implement zstd decompression 😃

just in case that becomes too heavy, I'll share my thoughts on alternative ways (up to you to decide and implement properly):

precalculate the uncompressed buffer size using the geotiff properties (tile size, bit depth, etc.)

Rdataflow force-pushed the feat--add_zstd branch 2 times, most recently from 516c9b9 to f80f727 Compare January 8, 2025 18:18

Rdataflow added 2 commits January 8, 2025 22:04

feat: add zstd compression

6e65842

Closes: geotiffjs#372

ci: add tests for zstd

d497708

Rdataflow force-pushed the feat--add_zstd branch from f80f727 to d497708 Compare January 8, 2025 21:04

Rdataflow mentioned this pull request Jan 9, 2025

ZSTD compressed COGs don't display geoadmin/web-mapviewer#1190

Open

constantinius reviewed Jan 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add zstd compression #451

feat: add zstd compression #451

Rdataflow commented Jan 8, 2025

constantinius Jan 9, 2025

constantinius Jan 9, 2025

Rdataflow Jan 9, 2025 •

edited

Loading

constantinius Jan 10, 2025

Rdataflow Jan 10, 2025

feat: add zstd compression #451

Are you sure you want to change the base?

feat: add zstd compression #451

Conversation

Rdataflow commented Jan 8, 2025

constantinius Jan 9, 2025

Choose a reason for hiding this comment

constantinius Jan 9, 2025

Choose a reason for hiding this comment

Rdataflow Jan 9, 2025 • edited Loading

Choose a reason for hiding this comment

constantinius Jan 10, 2025

Choose a reason for hiding this comment

Rdataflow Jan 10, 2025

Choose a reason for hiding this comment

Rdataflow Jan 9, 2025 •

edited

Loading