Reduce RowHash's tag space size by x2 #3543

yoniko · 2023-03-10T00:49:38Z

Allocate half the memory for tag space, which means that we get one less slot for an actual tag (needs to be used for next position index).
The results is a slight loss in compression ratio (up to 0.2%) and some regressions/improvements to speed depending on level and sample. In turn, we get to save 16% of the hash table's space (5 bytes per entry instead of 6 bytes per entry).

Update:
CR comments fixed.
Benchmark in spreadsheet.

Update 2:
Another interesting benchmark is fullbench -b41 -l9 enwik8.1k which repeatedly uses ZSTD_compressStream to recompress 1024 bytes out of enwik8. Since #3528 hasn't been deployed yet, we can observe that this PR makes the benchmark twice as fast for small data sizes as the majority of time is spent in the tag space intiailization.

terrelln

Generally looks good!

I did a quick benchmark and see just about neutral performance, +- a few percent based on the compiler. Also a small ratio loss, but it seems reasonable. I'll let you do the full benchmarks, just wanted to verify that the speed looked good on my machine too.

terrelln · 2023-03-10T01:21:15Z

lib/compress/zstd_lazy.c

@@ -804,9 +803,10 @@ U16 ZSTD_rotateRight_U16(U16 const value, U32 count) {
 * value to reflect the update. Essentially cycles backwards from [0, {entries per row})


You should update this comment

I thought I did, thank you for noticing!

lib/compress/zstd_lazy.c

terrelln · 2023-03-10T01:22:00Z

lib/compress/zstd_lazy.c

@@ -888,7 +888,7 @@ FORCE_INLINE_TEMPLATE void ZSTD_row_update_internalImpl(ZSTD_matchState_t* ms,
                                                        U32 const rowMask, U32 const useCache)
 {
    U32* const hashTable = ms->hashTable;
-    U16* const tagTable = ms->tagTable;
+    U8* const tagTable = ms->tagTable;


You should use BYTE here and elsewhere

terrelln · 2023-03-10T01:22:13Z

lib/compress/zstd_lazy.c

@@ -903,7 +903,7 @@ FORCE_INLINE_TEMPLATE void ZSTD_row_update_internalImpl(ZSTD_matchState_t* ms,
        U32 const pos = ZSTD_row_nextIndex(tagRow, rowMask);

        assert(hash == ZSTD_hashPtr(base + updateStartIdx, hashLog + ZSTD_ROW_HASH_TAG_BITS, mls));
-        ((BYTE*)tagRow)[pos + ZSTD_ROW_HASH_TAG_OFFSET] = hash & ZSTD_ROW_HASH_TAG_MASK;
+        ((BYTE*)tagRow)[pos] = hash & ZSTD_ROW_HASH_TAG_MASK;


Can get rid of the BYTE* cast now.

yoniko · 2023-03-10T01:48:28Z

@terrelln - did you benchmark on a mac or on an x86 server?

yoniko · 2023-03-10T02:46:09Z

Fixed CR comments and added a spreadsheet for benchmarks.
Compression ratio changes are less than 0.01% on average, there's a regression of up to 5% for levels 5-7 on Skylake but a performance increase for higher levels.

I've tried reducing this regression, but don't have a good solution at the moment and unless it's critical I'll not continue.

Allocate half the memory for tag space, which means that we get one less slot for an actual tag (needs to be used for next position index). In turn, we slash the memory usage for slightly worse compression ratio or better ratio if we use the same memory size with a higher hashLog.

terrelln · 2023-03-10T19:50:33Z

@terrelln - did you benchmark on a mac or on an x86 server?

x86

terrelln

Awesome!

…ys invalid facebook#3543 decreases the size of the tagTable by a factor of 2, which requires using the first tag position in each row for head position instead of a tag. Although position 0 stopped being a valid match, it still persisted in mask calculation resulting in the matches loops possibly terminating before it should have. The fix skips position 0 to solve this problem.

#3543 decreases the size of the tagTable by a factor of 2, which requires using the first tag position in each row for head position instead of a tag. Although position 0 stopped being a valid match, it still persisted in mask calculation resulting in the matches loops possibly terminating before it should have. The fix skips position 0 to solve this problem.

shubhamchandak94 · 2023-06-30T23:04:59Z

Hi, can this be reflected in the comment at https://github.com/facebook/zstd/blame/25822342be59d831bad65426ae51f5cc22157b09/lib/compress/zstd_lazy.c#L1116-L1118

Thanks!

facebook-github-bot added the CLA Signed label Mar 10, 2023

terrelln reviewed Mar 10, 2023

View reviewed changes

yoniko force-pushed the reduce-tag-space-footprint branch 2 times, most recently from cc96945 to faf5f15 Compare March 10, 2023 02:11

yoniko marked this pull request as ready for review March 10, 2023 02:21

yoniko force-pushed the reduce-tag-space-footprint branch from faf5f15 to 69be424 Compare March 10, 2023 06:23

terrelln approved these changes Mar 10, 2023

View reviewed changes

yoniko merged commit 33e3909 into facebook:dev Mar 10, 2023

yoniko mentioned this pull request Mar 11, 2023

[Bugfix] row hash tries to match position 0 #3548

Merged

Cyan4973 mentioned this pull request Apr 1, 2023

Preparation for release v1.5.5 #3585

Merged

MOHAMED19OS mentioned this pull request Apr 19, 2023

zstd: add Zstandard v1.5.5 crosstool-ng/crosstool-ng#1936

Closed

Cyan4973 mentioned this pull request Jul 5, 2023

Update rowhash code comment #3692

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce RowHash's tag space size by x2 #3543

Reduce RowHash's tag space size by x2 #3543

yoniko commented Mar 10, 2023 •

edited

Loading

terrelln left a comment

terrelln Mar 10, 2023

yoniko Mar 10, 2023

terrelln Mar 10, 2023

terrelln Mar 10, 2023

yoniko commented Mar 10, 2023

yoniko commented Mar 10, 2023 •

edited

Loading

terrelln commented Mar 10, 2023

terrelln left a comment

shubhamchandak94 commented Jun 30, 2023 •

edited

Loading

		@@ -804,9 +803,10 @@ U16 ZSTD_rotateRight_U16(U16 const value, U32 count) {
		* value to reflect the update. Essentially cycles backwards from [0, {entries per row})

Reduce RowHash's tag space size by x2 #3543

Reduce RowHash's tag space size by x2 #3543

Conversation

yoniko commented Mar 10, 2023 • edited Loading

terrelln left a comment

Choose a reason for hiding this comment

terrelln Mar 10, 2023

Choose a reason for hiding this comment

yoniko Mar 10, 2023

Choose a reason for hiding this comment

terrelln Mar 10, 2023

Choose a reason for hiding this comment

terrelln Mar 10, 2023

Choose a reason for hiding this comment

yoniko commented Mar 10, 2023

yoniko commented Mar 10, 2023 • edited Loading

terrelln commented Mar 10, 2023

terrelln left a comment

Choose a reason for hiding this comment

shubhamchandak94 commented Jun 30, 2023 • edited Loading

yoniko commented Mar 10, 2023 •

edited

Loading

yoniko commented Mar 10, 2023 •

edited

Loading

shubhamchandak94 commented Jun 30, 2023 •

edited

Loading