Initialize sequence numbers on a shrunken index #25321

jasontedor · 2017-06-20T21:09:07Z

Bringing together shards in a shrunken index means that we need to address the start of history for the shrunken index. The problem here is that sequence numbers before the maximum of the maximum sequence numbers on the source shards can collide in the target shards in the shrunken index. To address this, we set the maximum sequence number and the local checkpoint on the target shards to this maximum of the maximum sequence numbers. This enables correct document-level semantics for documents indexed before the shrink, and history on the shrunken index will effectively start from here.

Relates #10708

Bringing together shards in a shrunken index means that we need to address the start of history for the shrunken index. The problem here is that sequence numbers before the maximum of the maximum sequence numbers on the source shards can collide in the target shards in the shrunken index. To address this, we set the maximum sequence number and the local checkpoint on the target shards to this maximum of the maximum sequence numbers. This enables correct document-level semantics for documents indexed before the shrink, and history on the shrunken index will effectively start from here.

bleskes

Thx @jasontedor

bleskes · 2017-06-21T15:59:01Z

core/src/main/java/org/elasticsearch/index/shard/StoreRecovery.java

-                        .collect(Collectors.toList()).toArray(new Directory[shards.size()]));
+
+                    final Directory[] sources =
+                            shards.stream().map(LocalShardSnapshot::getSnapshotDirectory).collect(Collectors.toList()).toArray(new Directory[0]);


Nit - there is a stream::toArray(Directory[]::new))

bleskes · 2017-06-21T16:00:27Z

core/src/main/java/org/elasticsearch/index/shard/StoreRecovery.java

+            writer.setLiveCommitData(() -> {
+                final HashMap<String, String> liveCommitData = new HashMap<>(2);
+                liveCommitData.put(SequenceNumbers.MAX_SEQ_NO, Long.toString(maxSeqNo));
+                liveCommitData.put(SequenceNumbers.LOCAL_CHECKPOINT_KEY, Long.toString(maxSeqNo));


Is the plan to do MAX_UNSAFE_AUTO_ID_TIMESTAMP_COMMIT_ID as a follow up?

Yes, I edited the plan on #10708 to separate these out into separate line items, I do not like mixing things. I do forgive you for not seeing this. 😛

bleskes · 2017-06-21T16:01:15Z

core/src/test/java/org/elasticsearch/action/admin/indices/create/ShrinkIndexIT.java

@@ -233,7 +228,8 @@ public void testCreateShrinkIndex() {
            .put("number_of_shards", randomIntBetween(2, 7))
            .put("index.version.created", version)
        ).get();
-        for (int i = 0; i < 20; i++) {
+        final int docs = randomIntBetween(1, 128);


do we want to test with 0 docs too?

jasontedor added :Sequence IDs v6.0.0 labels Jun 20, 2017

jasontedor requested a review from bleskes June 20, 2017 21:09

bleskes mentioned this pull request Jun 20, 2017

Add Sequence Numbers to write operations #10708

Closed

64 tasks

Imports

4b051bc

bleskes approved these changes Jun 21, 2017

View reviewed changes

jasontedor added 2 commits June 21, 2017 12:21

Feedback

9659d33

Remove newline

e3d9ee7

jasontedor merged commit cc67d02 into elastic:master Jun 21, 2017

jasontedor mentioned this pull request Jul 4, 2017

org.elasticsearch.action.admin.indices.create.ShrinkIndexIT#testShrinkIndexPrimaryTerm() fails in CI #25421

Closed

clintongormley added v6.0.0-beta1 and removed v6.0.0 labels Jul 25, 2017

colings86 added the >feature label Jul 25, 2017

clintongormley added :Distributed Indexing/Engine Anything around managing Lucene and the Translog in an open shard. and removed :Sequence IDs labels Feb 14, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initialize sequence numbers on a shrunken index #25321

Initialize sequence numbers on a shrunken index #25321

jasontedor commented Jun 20, 2017

bleskes left a comment

bleskes Jun 21, 2017

bleskes Jun 21, 2017

jasontedor Jun 21, 2017

bleskes Jun 21, 2017

jasontedor Jun 21, 2017

Initialize sequence numbers on a shrunken index #25321

Initialize sequence numbers on a shrunken index #25321

Conversation

jasontedor commented Jun 20, 2017

bleskes left a comment

Choose a reason for hiding this comment

bleskes Jun 21, 2017

Choose a reason for hiding this comment

bleskes Jun 21, 2017

Choose a reason for hiding this comment

jasontedor Jun 21, 2017

Choose a reason for hiding this comment

bleskes Jun 21, 2017

Choose a reason for hiding this comment

jasontedor Jun 21, 2017

Choose a reason for hiding this comment