Force Merge: clarify execution and storage requirements #33882

frederikbosch · 2018-09-20T06:20:30Z

Because of the additional space required you I was wondering how operations are executed: sync or async. Docs did not give me an answer. Using trial and error I found out it is executed synchronously. I thought the information to be useful for others too.

Because of the additional space required you I was wondering how the operations are executed: sync or async. Docs did not give me an answer. Using trial and error I found out it is executed synchronously. I thought the information to be useful for others too.

elasticmachine · 2018-09-21T04:38:08Z

Pinging @elastic/es-core-infra

javanna · 2018-10-02T14:23:48Z

hi @frederikbosch , thanks for your contribution!

How did you come to the conclusion that we execute the operation sequentially, one index at a time?

I had a look at how the force merge API works: the coordinating node resolves the indices included in the request, finds out which shards the operation has to be executed on, then looks at where such shards are allocated and sends one request per node to execute the shard level force-merge operation. Such requests are sent in an async fashion. This is not "sync one index at a time". It does look like on each node, the shard-level force-merge is executed synchronously, one shard at a time.

frederikbosch · 2018-10-05T15:21:24Z

@javanna Then my observation was not correct indeed. The indices I executed the operation on were all having one shard. Therefore I assumed that the operation was synchronous per index, while actually it is synchronous per shard.

frederikbosch · 2018-10-05T15:43:16Z

@javanna I updated the PR. My concern is the storage increase people have to take into account when running this operation. Since the operation is sync per shard, it occurs to me that it is an increase of approx 100% of the largest shard is what could be expected (assuming read-only indices).

javanna

I left a comment, thanks again @frederikbosch

javanna · 2018-10-11T13:25:06Z

docs/reference/indices/forcemerge.asciidoc

@@ -55,7 +55,8 @@ POST /kimchy/_forcemerge?only_expunge_deletes=false&max_num_segments=100&flush=t
 === Multi Index

 The force merge API can be applied to more than one index with a single call, or
-even on `_all` the indices.
+even on `_all` the indices. Multi index operations are executed async for all nodes,
+but one shard at a time per node. This will cause storage to increase per node.


Can we remove the async part please, which is generally how Elasticsearch works? Also, I think it is valuable to add info about storage requirements, but if we do so we should be more specific. The worst case is when force_merge is executed with max_num_segments set to 1, in which case we should clarify that the storage temporarily goes up to potentially 100% of the size of the shard being merged. That means that running each shard per node sequentially is good when looking at storage requirements ;)

javanna · 2018-10-23T07:56:04Z

hi @frederikbosch let me know if you need help with this, if you don't have time to make the requested changes I can also make them so I can merge your PR. Let me know what you prefer ;)

frederikbosch · 2018-10-23T07:59:19Z

@javanna Yes sorry. At this moment I think I cannot give follow-up to the PR. If you could finish it, please go ahead. I think you are also better in formulating the exact behaviour.

javanna · 2018-10-23T10:25:15Z

thanks @frederikbosch !

frederikbosch · 2018-10-23T10:30:16Z

Thank you!

* master: (24 commits) ingest: better support for conditionals with simulate?verbose (elastic#34155) [Rollup] Job deletion should be invoked on the allocated task (elastic#34574) [DOCS] .Security index is never auto created (elastic#34589) CCR: Requires soft-deletes on the follower (elastic#34725) re-enable bwc tests (elastic#34743) Empty GetAliases authorization fix (elastic#34444) INGEST: Document Processor Conditional (elastic#33388) [CCR] Add total fetch time leader stat (elastic#34577) SQL: Support pattern against compatible indices (elastic#34718) [CCR] Auto follow pattern APIs adjustments (elastic#34518) [Test] Remove dead code from ExceptionSerializationTests (elastic#34713) A small typo in migration-assistance doc (elastic#34704) ingest: processor stats (elastic#34724) SQL: Implement IN(value1, value2, ...) expression. (elastic#34581) Tests: Add checks to GeoDistanceQueryBuilderTests (elastic#34273) INGEST: Rename Pipeline Processor Param. (elastic#34733) Core: Move IndexNameExpressionResolver to java time (elastic#34507) [DOCS] Force Merge: clarify execution and storage requirements (elastic#33882) TESTING.asciidoc fix examples using forbidden annotation (elastic#34515) SQL: Implement `CONVERT`, an alternative to `CAST` (elastic#34660) ...

frederikbosch changed the title ~~Multi index operations are executed synchronously.~~ Force Merge: multi index operations are executed synchronously. Sep 20, 2018

astefan added the :Data Management/Indices APIs APIs to create and manage indices and templates label Sep 21, 2018

astefan added :Data Management/Indices APIs APIs to create and manage indices and templates and removed :Data Management/Indices APIs APIs to create and manage indices and templates labels Sep 21, 2018

javanna added the feedback_needed label Oct 2, 2018

Update remark on sync/async merging multiple indices

026365d

javanna requested changes Oct 11, 2018

View reviewed changes

javanna removed the feedback_needed label Oct 22, 2018

javanna added 2 commits October 23, 2018 12:21

clarified sentence

aaec237

Merge branch 'master' into patch-1

78d8332

javanna added >docs General docs changes v7.0.0 v6.5.0 labels Oct 23, 2018

javanna changed the title ~~Force Merge: multi index operations are executed synchronously.~~ Force Merge: clarify execution and storage requirements Oct 23, 2018

javanna merged commit 183c32d into elastic:master Oct 23, 2018

javanna pushed a commit that referenced this pull request Oct 23, 2018

[DOCS] Force Merge: clarify execution and storage requirements (#33882)

c4a687a

kcm pushed a commit that referenced this pull request Oct 30, 2018

[DOCS] Force Merge: clarify execution and storage requirements (#33882)

4564085

colings86 added v7.0.0-beta1 and removed v7.0.0 labels Feb 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Force Merge: clarify execution and storage requirements #33882

Force Merge: clarify execution and storage requirements #33882

frederikbosch commented Sep 20, 2018 •

edited

Loading

elasticmachine commented Sep 21, 2018

javanna commented Oct 2, 2018

frederikbosch commented Oct 5, 2018

frederikbosch commented Oct 5, 2018

javanna left a comment

javanna Oct 11, 2018

javanna commented Oct 23, 2018

frederikbosch commented Oct 23, 2018

javanna commented Oct 23, 2018

frederikbosch commented Oct 23, 2018

Force Merge: clarify execution and storage requirements #33882

Force Merge: clarify execution and storage requirements #33882

Conversation

frederikbosch commented Sep 20, 2018 • edited Loading

elasticmachine commented Sep 21, 2018

javanna commented Oct 2, 2018

frederikbosch commented Oct 5, 2018

frederikbosch commented Oct 5, 2018

javanna left a comment

Choose a reason for hiding this comment

javanna Oct 11, 2018

Choose a reason for hiding this comment

javanna commented Oct 23, 2018

frederikbosch commented Oct 23, 2018

javanna commented Oct 23, 2018

frederikbosch commented Oct 23, 2018

frederikbosch commented Sep 20, 2018 •

edited

Loading