Log structured merge trees #2

at15 · 2017-01-20T23:03:37Z

at15 · 2017-01-20T23:25:29Z

Blog

memtable + ss index + sstable
- are keys sorted in memtable
- the memtable is a hashtable or other? like I can use two array, one for key, one for value, it's just need to loop to find the array index of a given key
- does sstable store key when using ss index
- how is collapse achieved

leveldb code

https://github.com/google/leveldb/blob/master/db/memtable.cc

Take away

sstable indexes (key + offset) are loaded into memory
write goes to memtable
memtable get flushed to disk
sstable are merged (collapsed together?)

at15 · 2017-01-21T06:16:57Z

Quora

Take away

how to manage efficiently merging only sub-portions of the key-space
- LevelDB/RocksDB tackles it by liberally relying on a b-tree based intermediate layer, the filesystem.
- The LSM in Cassandra, HBase, and Hypertable are very close to the LevelDB "filesystem layered" approach

C_0, C_1 (this is not the case for sstable guys I guess)

C_0 is AVL tree
C_1 is B tree

About merging process

Once written, older generations are never modified. You do row modifications by rewriting the rows/records into a newer generation where it is found first.

The nice thing is that the old data is still being used by the system while this mergy happens "in the background". During the merge process, rows/records modified by newer generations or marked deleted are removed by simply not writing them. Once the merge is complete, this combined generation replaces the two it merged.

at15 · 2017-01-22T00:27:49Z

Original paper

C_1 (on disk) level is SB-Tree

The most important part in the original paper should be about rolling merge, and especially how it handles recovery, but as I assume, its main focus is on index

TODO: is the original paper focus on index but later application use this method for storing actual data

at15 · 2017-01-28T02:01:20Z

Some questions asked by other students

what happens when I delete something in the lower level, if you delete something that is in memory, you can add a tombstone for it, but if it is in disk, do you load it to buffer and add a tombstone for it, or you read from multi levels and apply the filter when merge

at15 self-assigned this Jan 20, 2017

at15 mentioned this issue Jan 21, 2017

log structured merge tree at15/mini-impl#5

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Log structured merge trees #2

Log structured merge trees #2

at15 commented Jan 20, 2017 •

edited

Loading

at15 commented Jan 20, 2017 •

edited

Loading

at15 commented Jan 21, 2017 •

edited

Loading

at15 commented Jan 22, 2017 •

edited

Loading

at15 commented Jan 28, 2017 •

edited

Loading

Log structured merge trees #2

Log structured merge trees #2

Comments

at15 commented Jan 20, 2017 • edited Loading

at15 commented Jan 20, 2017 • edited Loading

at15 commented Jan 21, 2017 • edited Loading

at15 commented Jan 22, 2017 • edited Loading

at15 commented Jan 28, 2017 • edited Loading

at15 commented Jan 20, 2017 •

edited

Loading

at15 commented Jan 20, 2017 •

edited

Loading

at15 commented Jan 21, 2017 •

edited

Loading

at15 commented Jan 22, 2017 •

edited

Loading

at15 commented Jan 28, 2017 •

edited

Loading