Update release/v3.2011 branch #1634

jarifibrahim · 2021-01-05T08:02:26Z

This change is

We removed the global z.Allocator pool from z package. Instead, we now use a new z.AllocatorPool class in the places which need a pool. In this case, we brought it to TableBuilder and Stream. Fix up a memory leak in Stream. Co-authored-by: Ibrahim Jarif <ibrahim@dgraph.io>

Decrease the size of DISCARD file and WAL for Memtables.

Remove z.Buffer from skiplist because we are using static sized buffer.

Do not use AllocatorPool, because that has shown weird crashes due to Go's interpretation of slices. The new system uses Go memory for z.Allocator and avoids reusing it.

The orchestrate function would get blocked forever if send function returned an error. The produceKv go routines would also get blocked since the size of the error chan was 1.

This fixes two issues - Atomic variable was not being accessed correctly - Atomic variable should be the first member of the struct to ensure proper alignment. Failure to do so will cause a segmentation fault. Fixes DGRAPH-2773

Add debugging information in yieldItemValue function to find the root cause of missing vlog files error. Note: This commit should be reverted once the issue has been resolved.

Stream.Send now sends out z.Buffer instead of pb.KVList. z.Buffer marshals each KV as a separate slice. This significantly reduces the memory requirement by the Stream framework. Stream no longer uses z.Allocator or tries to put pb.KV struct on the Allocator for memory safety reasons. Bring back the z.AllocatorPool for table.Builder. Changes: * Use z.Buffer for stream.Send * Only use 8 streams in write bench * Revert "Bug Fix: Fix up how we use z.Allocator" This reverts commit 5ff9e1d. * Bring allocator back. Use z.Buffer for send * Add BufferToKVList function * Print jemalloc while stream * Bring in latest Ristretto * Fix memory leak and benchmark read test Co-authored-by: Ibrahim Jarif <ibrahim@dgraph.io>

Remove `Github issues` links

Even for some integer fields like MaxVersion, KeyCount, OffsetsLength, we end up calling `fetchIndex`, which hits Ristretto cache. Instead, we could just keep these in memory always, because they're so cheap. This PR does that.

We store a slice of pb.KVs in Iterator, so it can be used by Stream users.

NewKeyIterator uses pickTables which was optimized in the past. But, a recent PR: #1546 removed this optimization, which is now making NewKeyIterator quite expensive. This PR brings that optimization back.

The DefaultOptions already has snappy set to the default, so the stream CLI tool aligns with that now.

* chore(cmd/info): Fix printed spacing of summary. Column-aligns the output for the Summary section: Before: [Summary] Level 0 size: 0 B Level 1 size: 2.3 kB Total SST size: 2.3 kB Value log size: 20 B After: [Summary] Level 0 size: 0 B Level 1 size: 2.3 kB Total SST size: 2.3 kB Value log size: 20 B * fix: Set block cache and index cache sizes. This fixes a panic when running badger info panic: BlockCacheSize should be set since compression/encryption are enabled

* Let's use allocator again * Switch to z.NumAllocBytes * Make stream work with both another Badger DB or with a file to backup. * Add a test for Allocator * Use allocator for Backup * Bring in latest Ristretto Co-authored-by: Daniel Mai <daniel@dgraph.io>

In edge cases, we end up with too many splits during compactions, which make compactions take up too much RAM. Avoid that by limiting splits to max 5. Also, avoid running more compactions when the memory usage is going above 16GB.

If a table has a mixture of value log pointers and embedded values, badger will carry over the last length from a value log entry into the subsequent embedded entries. Co-authored-by: Raúl Kripalani <raul@protocol.ai>

Since we are in process of moving to Netlify, we need this change for docs to work. This change has no effect on current badger docs

This sets relativeURLs to false (the default value). If it's set to true, then the URLs generated by Hugo are incorrect. e.g., in the HTML the incorrect URL is created starting with "./". <link href='./docs/badger/css/theme.css?ed88a5fdbf06b9737b9afdf41f9e2902' rel="stylesheet" /> With this change, the correct URL is created starting with "/". <link href='/docs/badger/css/theme.css?ed88a5fdbf06b9737b9afdf41f9e2902' rel="stylesheet" />

When keys are moved because of the GC, we were removing the bitDiscardEarlierVersions and other bits. Only the transaction markers should be removed and all the other bits should be kept.

…v3.2011 Conflicts: go.mod go.sum stream.go stream_test.go test.sh

pb/badgerpb2.proto

jarifibrahim

LGTM

ajeetdsouza and others added 30 commits November 10, 2020 14:06

fix: Disable CompactL0OnClose by default (#1586)

8e4085b

Add smallstep/certificates to projects using badger (#1589)

50f07a6

Bring in Ristretto with allocator fix.

e002a9d

fix(readonly): fix the file opening mode (#1592)

34c6154

Small improvements (#1598)

741de05

Decrease the size of DISCARD file and WAL for Memtables.

fix(skiplist): Remove z.Buffer from skiplist (#1600)

5a96b2c

Remove z.Buffer from skiplist because we are using static sized buffer.

Bug Fix: Fix up how we use z.Allocator

5ff9e1d

Do not use AllocatorPool, because that has shown weird crashes due to Go's interpretation of slices. The new system uses Go memory for z.Allocator and avoids reusing it.

Test: Add a test for encryption.

340ccfc

fix(stream): Stop produceKVs on error (#1604)

f36daf5

The orchestrate function would get blocked forever if send function returned an error. The produceKv go routines would also get blocked since the size of the error chan was 1.

Fix race condition in L0StallMs variable (#1605)

925e15b

This fixes two issues - Atomic variable was not being accessed correctly - Atomic variable should be the first member of the struct to ensure proper alignment. Failure to do so will cause a segmentation fault. Fixes DGRAPH-2773

chore(iterators): Do not return error on missing vlog (#1602)

feb1f5f

Add debugging information in yieldItemValue function to find the root cause of missing vlog files error. Note: This commit should be reverted once the issue has been resolved.

Bug Fix: Create the right txn based on Badger mode

3d225d7

Remove link from index.md (#1607)

2d88aea

Remove `Github issues` links

Fix(OOM): Reuse pb.KVs in Stream (#1609)

70088c6

We store a slice of pb.KVs in Iterator, so it can be used by Stream users.

Opt(pickTables): Fix an optimization regression

74f2e02

NewKeyIterator uses pickTables which was optimized in the past. But, a recent PR: #1546 removed this optimization, which is now making NewKeyIterator quite expensive. This PR brings that optimization back.

chore(cmd/stream): Default to snappy for compression flag. (#1610)

3782d88

The DefaultOptions already has snappy set to the default, so the stream CLI tool aligns with that now.

Make itr.Alloc public

b80c792

Fix: Do not stop Stream due to KeyToList error.

5e41893

Bring in latest Ristretto

3a4d8e7

Fix(OOM): Avoid too many splits

63726a8

In edge cases, we end up with too many splits during compactions, which make compactions take up too much RAM. Avoid that by limiting splits to max 5. Also, avoid running more compactions when the memory usage is going above 16GB.

Fix typo in width calculation.

6142e81

fix(compaction): fix table size estimation on compaction (#1613)

f492aa3

If a table has a mixture of value log pointers and embedded values, badger will carry over the last length from a value log entry into the subsequent embedded entries. Co-authored-by: Raúl Kripalani <raul@protocol.ai>

docs: Set relativeURL for Netlify. (#1612)

eaf91a1

Since we are in process of moving to Netlify, we need this change for docs to work. This change has no effect on current badger docs

docs: Set relative URLs in config. (#1616)

068c66d

NamanJain8 and others added 4 commits December 8, 2020 22:50

fix(tableBuilding): reduce scope of valuePointer (#1617)

c4b8179

fix(GC): Set bits correctly for moved keys (#1619)

4fea5dc

When keys are moved because of the GC, we were removing the bitDiscardEarlierVersions and other bits. Only the transaction markers should be removed and all the other bits should be kept.

Revert mmap in skiplists (#1620)

bcfae61

Merge commit 'bcfae6104545e824b40814e4bc3bc4b1430c3367' into release/…

9a34eed

…v3.2011 Conflicts: go.mod go.sum stream.go stream_test.go test.sh

jarifibrahim requested a review from manishrjain as a code owner January 5, 2021 08:02

changing badger module path to v3

54688d8

jarifibrahim commented Jan 6, 2021

View reviewed changes

pb/badgerpb2.proto Outdated Show resolved Hide resolved

changing metrics to v3 and badgerpb2 to badgerpb3

6266a4e

aman-bansal approved these changes Jan 15, 2021

View reviewed changes

jarifibrahim commented Jan 15, 2021

View reviewed changes

aman-bansal merged commit 6266a4e into release/v3.2011 Jan 15, 2021

joshua-goldstein deleted the ibrahim/r3.2011/update branch October 14, 2022 01:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update release/v3.2011 branch #1634

Update release/v3.2011 branch #1634

jarifibrahim commented Jan 5, 2021 •

edited by manishrjain

Loading

jarifibrahim left a comment

Update release/v3.2011 branch #1634

Update release/v3.2011 branch #1634

Conversation

jarifibrahim commented Jan 5, 2021 • edited by manishrjain Loading

jarifibrahim left a comment

Choose a reason for hiding this comment

jarifibrahim commented Jan 5, 2021 •

edited by manishrjain

Loading