Optimize XidtoUID map used by live and bulk loader #2998

manishrjain · 2019-02-10T03:39:30Z

I've spent the last few days looking at how to optimize the live mutation path in Dgraph server. While trying many things in the server (past commits included), I realized my optimizations in the server are not improving things much, the throughput saturating at 20-30K NQuads/sec.

Turns out, it was the live loader which was causing the saturation. The XID to UID assigner was the bottleneck causing the throughput to stagnate, despite the server being underutilized.

This PR fixes that by optimizing the assigner. In particular, I've removed the slow LRU cache. Added buffer to newRanges channel to ensure we always have a range handy when we run out. Made passing badger DB instance optional, so we can avoid doing disk writes if not required. And made other optimizations around how we lock, etc. I also added benchmarks for the assigner, which shows each allocation (tested via parallel benchmark) takes 350 ns/op on my desktop.

With these changes, the live loader throughput jumps to 100K-120K NQuads/sec on my desktop. In particular, pre-assigning UIDs to the RDF/JSON file yields maximum throughput. I can load 140M friend graph RDFs in 25 mins.

Helps with #2975 .

This change is

…n memory.

…nchmark shows each allocation is 300ns.

…through the newRanges channel.

martinmr

Reviewed 8 of 8 files at r1.
Reviewable status: all files reviewed, 2 unresolved discussions (waiting on @manishrjain)

xidmap/xidmap.go, line 60 at r1 (raw file):

}

// This must already have a write lock.

Change comment format to match what the linter expects. Something like:

assign assumes the lock is already acquired at call time.

xidmap/xidmap.go, line 198 at r1 (raw file):

// BumpTo can be used to make Zero allocate UIDs up to this given number. Attempts are made to
// ensure all future allocations of UIDs be higher than this one, but result is not guaranteed.

nit: results are not guaranteed

manishrjain

Reviewable status: 7 of 8 files reviewed, 1 unresolved discussion (waiting on @martinmr)

xidmap/xidmap.go, line 60 at r1 (raw file):

Previously, martinmr (Martin Martinez Rivera) wrote…

Change comment format to match what the linter expects. Something like:

assign assumes the lock is already acquired at call time.

Done.

I've spent the last few days looking at how to optimize the live mutation path in Dgraph server. While trying many things in the server (past commits included), I realized my optimizations in the server are not improving things much, the throughput saturating at 20-30K NQuads/sec. Turns out, it was the live loader which was causing the saturation. The XID to UID assigner was the bottleneck causing the throughput to stagnate, despite the server being underutilized. This PR fixes that by optimizing the assigner. In particular, I've removed the slow LRU cache. Added buffer to `newRanges` channel to ensure we always have a range handy when we run out. Made passing badger DB instance optional, so we can avoid doing disk writes if not required. And made other optimizations around how we lock, etc. I also added benchmarks for the assigner, which shows each allocation (tested via parallel benchmark) takes 350 ns/op on my desktop. With these changes, the live loader throughput jumps to 100K-120K NQuads/sec on my desktop. In particular, pre-assigning UIDs to the RDF/JSON file yields maximum throughput. I can load 140M friend graph RDFs in 25 mins. Helps with dgraph-io#2975 . Changes: * Work on optimizing XidToUid map. * Add the test and benchmark for xid to uid map * Working code with decreased memory usage. Includes a new BumpUp API. * Working live loader, which can optionally just keep all the mapping in memory. * Adding shards back to XidMap speed up operations by a huge factor. Benchmark shows each allocation is 300ns. * Make BumpTo much faster by calling Zero directly, instead of looping through the newRanges channel. * Improve how BumpTo() happens by using a maxSeenUid variable.

manishrjain added 6 commits February 8, 2019 16:05

Work on optimizing XidToUid map.

a3ecbb0

Add the test and benchmark for xid to uid map

febc181

Working code with decreased memory usage. Includes a new BumpUp API.

d175657

Working live loader, which can optionally just keep all the mapping i…

b85b81d

…n memory.

Changes found while running

90d7154

Adding shards back to XidMap speed up operations by a huge factor. Be…

af49090

…nchmark shows each allocation is 300ns.

manishrjain requested a review from martinmr February 10, 2019 04:37

manishrjain added 2 commits February 9, 2019 21:08

Make BumpTo much faster by calling Zero directly, instead of looping …

a698ef5

…through the newRanges channel.

Reassign all shards after bumping up to a UID.

21feaa2

manishrjain mentioned this pull request Feb 10, 2019

Exports should maintain the UIDs #2999

Closed

Improve how BumpTo() happens by using a maxSeenUid variable.

0c3a233

martinmr approved these changes Feb 11, 2019

View reviewed changes

Martin's review

071826d

manishrjain commented Feb 11, 2019

View reviewed changes

Martin review

7825041

manishrjain merged commit b718993 into master Feb 11, 2019

manishrjain deleted the mrjn/xiduidmap branch February 11, 2019 23:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize XidtoUID map used by live and bulk loader #2998

Optimize XidtoUID map used by live and bulk loader #2998

manishrjain commented Feb 10, 2019 •

edited

Loading

martinmr left a comment

manishrjain left a comment

Optimize XidtoUID map used by live and bulk loader #2998

Optimize XidtoUID map used by live and bulk loader #2998

Conversation

manishrjain commented Feb 10, 2019 • edited Loading

martinmr left a comment

Choose a reason for hiding this comment

manishrjain left a comment

Choose a reason for hiding this comment

manishrjain commented Feb 10, 2019 •

edited

Loading