[occ] Iterateset tracking and validation implementation #337

udpatil · 2023-10-19T04:01:40Z

Describe your changes and provide context

This implements a tracked iterator that is used to keep track of keys that have been iterated, and to also save metadata about the iteration for LATER validation. The iterator will be replayed and if there are any new keys / any keys missing within the iteration range, it will fail validation. the actual values served by the iterator are covered by readset validation.

Additionally, the early stop behavior allows the iterateset to ONLY be sensitive to changes to the keys available WITHIN the iteration range. In the event that we perform iteration, and THEN write a key within the range of iteration, this will not fail iteration because we take a snapshot of the mvkv writeset at the moment of iteration, so when we replay the iterator, we populate that iterator with the writeset at that time, so we appropriately replicate the iterator behavior.

In the case that we encounter an ESTIMATE, we have to terminate the iterator validation and mark it as failed because it is impossible to know whether that ESTIMATE represents a value change or a delete, since the latter, will affect the keys available for iteration.

This change also implements handlers that iterators receive for updating readset and iterateset in the mvkv

Testing performed to validate your change

Unit tests for various iteration scenarios

codchen

overall looks good. Left some nits

codchen · 2023-10-19T04:10:15Z

store/multiversion/memiterator.go

 	// if we have a deleted value, return nil
 	if val.IsDeleted() {
+		mi.ReadsetHandler.UpdateReadSet(key, nil)


nit can this be a defer?

yup, changed, although i'm not certain that it changes the outcome that much?

codchen · 2023-10-19T04:17:47Z

store/multiversion/store.go

+
+	// get all writeset keys prior to index
+	keys := s.GetAllWritesetKeys()
+	for i := 0; i < index; i++ {


should we also check i < len(keys)?

why is that? i is used to access the writeset corresponding to a specific transaction index in the multiversion store, theres no guarantee that all indices i < len(keys) would be present this moment, in which case we should skip to the next index. I'll add in an ok check for the map value before iterating over the indexedWriteset for explicit checking of presence, but it should still no-op given that a range over a nil slice would no-op, right?

…, since we can validate again

## Describe your changes and provide context This implements a tracked iterator that is used to keep track of keys that have been iterated, and to also save metadata about the iteration for LATER validation. The iterator will be replayed and if there are any new keys / any keys missing within the iteration range, it will fail validation. the actual values served by the iterator are covered by readset validation. Additionally, the early stop behavior allows the iterateset to ONLY be sensitive to changes to the keys available WITHIN the iteration range. In the event that we perform iteration, and THEN write a key within the range of iteration, this will not fail iteration because we take a snapshot of the mvkv writeset at the moment of iteration, so when we replay the iterator, we populate that iterator with the writeset at that time, so we appropriately replicate the iterator behavior. In the case that we encounter an ESTIMATE, we have to terminate the iterator validation and mark it as failed because it is impossible to know whether that ESTIMATE represents a value change or a delete, since the latter, will affect the keys available for iteration. This change also implements handlers that iterators receive for updating readset and iterateset in the `mvkv` ## Testing performed to validate your change Unit tests for various iteration scenarios

udpatil added 2 commits October 18, 2023 20:14

implement iterateset validation

5b93778

fix early stop detection in tracked iterator and update tests

ddb92a8

udpatil requested review from stevenlanders and codchen October 19, 2023 04:01

codchen approved these changes Oct 19, 2023

View reviewed changes

udpatil added 4 commits October 18, 2023 23:36

resolved race with iterator close and removed some TODOs

65369e3

Address comments

ebcc614

Address comments

7cfcbe7

update test

8d72fc4

stevenlanders approved these changes Oct 19, 2023

View reviewed changes

udpatil added 3 commits October 19, 2023 08:40

add todo

828ad57

Add validation condition where estimates don't invalidate the readset…

34ee51f

…, since we can validate again

remove todo

690034d

udpatil merged commit b34d61c into occ-main Oct 19, 2023
14 checks passed

udpatil deleted the iterateset-validation branch October 19, 2023 14:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[occ] Iterateset tracking and validation implementation #337

[occ] Iterateset tracking and validation implementation #337

udpatil commented Oct 19, 2023 •

edited

Loading

codchen left a comment

codchen Oct 19, 2023

udpatil Oct 19, 2023

codchen Oct 19, 2023

udpatil Oct 19, 2023

[occ] Iterateset tracking and validation implementation #337

[occ] Iterateset tracking and validation implementation #337

Conversation

udpatil commented Oct 19, 2023 • edited Loading

Describe your changes and provide context

Testing performed to validate your change

codchen left a comment

Choose a reason for hiding this comment

codchen Oct 19, 2023

Choose a reason for hiding this comment

udpatil Oct 19, 2023

Choose a reason for hiding this comment

codchen Oct 19, 2023

Choose a reason for hiding this comment

udpatil Oct 19, 2023

Choose a reason for hiding this comment

udpatil commented Oct 19, 2023 •

edited

Loading