Pull index routing into strategy object #77211

nik9000 · 2021-09-02T18:35:52Z

This pulls the calculation of the shard id for an (id, routing) pair
into a little strategy class, IndexRouting. This is easier to test and
should be easier to extend.

My hope is that this is an incremental readability improvement. My
ulterior motive is that this is where I want to land our new
routing-by-dimensions work for tsdb.

This pulls the calculation of the shard id for an (id, routing) pair into a little strategy class, `IndexRouting`. This is easier to test and should be easier to extend. My hope is that this is an incremental readability improvement. My ulterior motive is that this is where I want to land our new routing-by-dimensions work for tsdb.

elasticmachine · 2021-09-02T18:35:55Z

Pinging @elastic/es-distributed (Team:Distributed)

nik9000

I feel bad adding another per index object, but ultimately tsdb is going to need a larger object in the same place. In my mind its just a different subclass of IndexRouting. And it'll contain a compiled matcher-like-thing. So I think I want to build some way to deduplicate these things sooner rather than later.

nik9000 · 2021-09-02T18:37:47Z

server/src/test/java/org/elasticsearch/cluster/routing/IndexRoutingTests.java

+        }
+    }
+
+    public void testPartitionedIndex() {


This and all tests below it duplication OperationRoutingTests. They are slightly lower level in that they don't test the creation of the IndexRouting from information in the IndexMetadata. But they are close. Paranoia drove me to duplicate them rather than move them.

Now that I've reworked how this is integrated I've moved these test from OperationRoutingTests .

henningandersen · 2021-09-09T13:33:24Z

We discussed this on another channel and Nik will work on a different solution, trying to just share the IndexRouting per bulk request at least in the initial iteration.

nik9000

@henningandersen I've updated this based on our conversation this morning.

nik9000 · 2021-09-09T18:59:11Z

server/src/test/java/org/elasticsearch/cluster/routing/IndexRoutingTests.java

+        }
+    }
+
+    public void testPartitionedIndex() {


Now that I've reworked how this is integrated I've moved these test from OperationRoutingTests .

nik9000 · 2021-09-09T20:39:42Z

run elasticsearch-ci/part-2

henningandersen

LGTM, thanks Nik.

henningandersen · 2021-09-10T06:42:40Z

server/src/main/java/org/elasticsearch/action/bulk/TransportBulkAction.java

+                    IndexRouting indexRouting = indexRoutings.computeIfAbsent(
+                        concreteIndex,
+                        idx -> IndexRouting.fromIndexMetadata(clusterState.metadata().getIndexSafe(idx))
+                    );
+                    ShardId shardId = clusterService.operationRouting()
+                        .indexShards(clusterState, concreteIndex.getName(), indexRouting, docWriteRequest.id(), docWriteRequest.routing())
+                        .shardId();


As a future refinement (not necessarily in this PR), I wonder if OperationRouting should create the IndexRouting and the IndexRouting object then should have the indexShards (and more) method(s)? So the interaction here would be something like:

Suggested change

IndexRouting indexRouting = indexRoutings.computeIfAbsent(

concreteIndex,

idx -> IndexRouting.fromIndexMetadata(clusterState.metadata().getIndexSafe(idx))

);

ShardId shardId = clusterService.operationRouting()

.indexShards(clusterState, concreteIndex.getName(), indexRouting, docWriteRequest.id(), docWriteRequest.routing())

.shardId();

ShardId shardId = indexRoutings.computeIfAbsent(

concreteIndex,

idx -> clusterService.operationRouting().indexRouting(clusterState, idx))

).indexShards(docWriteRequest.id(), docWriteRequest.routing())

.shardId();

I do see the problem in doing so for ShardSplittingQuery though. And it may be affected by your future work too, where you may want bulk to only create a new IndexRouting instance when the base parameters are different. So I am OK leaving this as is for now if you prefer to not tackle that now.

Moving the methods like indexShards into makes sense. I liked that the class as it stands now doesn't need to know about stuff like the routing table. I think which was is right will show up more clearly after I add the source based routing stuff so I'll keep it as it is for now. Once I have a proposal for source based routing.

henningandersen · 2021-09-10T06:45:22Z

server/src/main/java/org/elasticsearch/cluster/routing/IndexRouting.java

+     * Build the routing from {@link IndexMetadata}.
+     */
+    public static IndexRouting fromIndexMetadata(IndexMetadata indexMetadata) {
+        if (indexMetadata.getRoutingPartitionSize() == 1) {


nit, why not use the isRoutingPartitionedIndex:

Suggested change

if (indexMetadata.getRoutingPartitionSize() == 1) {

if (indexMetadata.isRoutingPartitionedIndex() == false) {

I went back and forth. I suppose I had the effects of the partition size in my head and liked the way I had it better. But I think you are right its more clear that way. I'll push the change.

elasticsearchmachine · 2021-09-10T13:52:02Z

💔 Backport failed

Status	Branch	Result
❌	7.x	Commit could not be cherrypicked due to conflicts

To backport manually run backport --upstream elastic/elasticsearch --pr 77211

This pulls the calculation of the shard id for an (id, routing) pair into a little strategy class, `IndexRouting`. This is easier to test and should be easier to extend. My hope is that this is an incremental readability improvement. My ulterior motive is that this is where I want to land our new routing-by-dimensions work for tsdb.

nik9000 added >non-issue :Distributed Indexing/Distributed A catch all label for anything in the Distributed Area. Please avoid if you can. v8.0.0 v7.16.0 labels Sep 2, 2021

elasticmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label Sep 2, 2021

nik9000 commented Sep 2, 2021

View reviewed changes

imotov requested a review from henningandersen September 8, 2021 20:35

nik9000 added 2 commits September 9, 2021 09:43

Merge branch 'master' into index_routing

2645ede

WIP

8f059d1

nik9000 commented Sep 9, 2021

View reviewed changes

Merge branch 'master' into index_routing

5ba5ad2

henningandersen approved these changes Sep 10, 2021

View reviewed changes

nik9000 added 2 commits September 10, 2021 08:51

Merge branch 'master' into index_routing

c893bb0

Switch build check

d3d12f6

nik9000 added auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) auto-backport-and-merge labels Sep 10, 2021

elasticsearchmachine merged commit b0b5cbd into elastic:master Sep 10, 2021

nik9000 added the backport pending label Sep 10, 2021

nik9000 removed the backport pending label Sep 10, 2021

nik9000 mentioned this pull request Sep 13, 2021

Add better support for metric data types (TSDB) #74660

Closed

jakelandis added v8.0.0-alpha2 and removed v8.0.0 labels Sep 15, 2021

wchaparro assigned nik9000 Dec 16, 2021

wchaparro unassigned nik9000 Dec 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pull index routing into strategy object #77211

Pull index routing into strategy object #77211

nik9000 commented Sep 2, 2021

elasticmachine commented Sep 2, 2021

nik9000 left a comment

nik9000 Sep 2, 2021

nik9000 Sep 9, 2021

henningandersen commented Sep 9, 2021

nik9000 left a comment

nik9000 Sep 9, 2021

nik9000 commented Sep 9, 2021

henningandersen left a comment

henningandersen Sep 10, 2021

nik9000 Sep 10, 2021

henningandersen Sep 10, 2021

nik9000 Sep 10, 2021

elasticsearchmachine commented Sep 10, 2021

	if (indexMetadata.getRoutingPartitionSize() == 1) {
	if (indexMetadata.isRoutingPartitionedIndex() == false) {

Pull index routing into strategy object #77211

Pull index routing into strategy object #77211

Conversation

nik9000 commented Sep 2, 2021

elasticmachine commented Sep 2, 2021

nik9000 left a comment

Choose a reason for hiding this comment

nik9000 Sep 2, 2021

Choose a reason for hiding this comment

nik9000 Sep 9, 2021

Choose a reason for hiding this comment

henningandersen commented Sep 9, 2021

nik9000 left a comment

Choose a reason for hiding this comment

nik9000 Sep 9, 2021

Choose a reason for hiding this comment

nik9000 commented Sep 9, 2021

henningandersen left a comment

Choose a reason for hiding this comment

henningandersen Sep 10, 2021

Choose a reason for hiding this comment

nik9000 Sep 10, 2021

Choose a reason for hiding this comment

henningandersen Sep 10, 2021

Choose a reason for hiding this comment

nik9000 Sep 10, 2021

Choose a reason for hiding this comment

elasticsearchmachine commented Sep 10, 2021

💔 Backport failed