[ML] Introduce a setting for the process connect timeout #43234

droberts195 · 2019-06-14T14:26:21Z

This change introduces a new setting,
xpack.ml.process_connect_timeout, to enable
the timeout for one of the external ML processes
to connect to the ES JVM to be increased.

The default timeout of 10 seconds is the same
as the hardcoded timeout in previous versions.

The timeout may need to be increased if many
processes are being started simultaneously on
the same machine. This is unlikely in clusters
with many ML nodes, as we balance the processes
across the ML nodes, but can happen in clusters
with a single ML node and a high value for
xpack.ml.node_concurrent_job_allocations.

This change introduces a new setting, xpack.ml.process_connect_timeout, to enable the timeout for one of the external ML processes to connect to the ES JVM to be increased. The timeout may need to be increased if many processes are being started simultaneously on the same machine. This is unlikely in clusters with many ML nodes, as we balance the processes across the ML nodes, but can happen in clusters with a single ML node and a high value for xpack.ml.node_concurrent_job_allocations.

elasticmachine · 2019-06-14T14:26:24Z

Pinging @elastic/ml-core

droberts195 · 2019-06-14T14:35:24Z

I deliberately didn't use the configurable timeout for the controller process, because at the point that's started we cannot possibly be starting lots of other processes simultaneously so contention should not be a problem and it's best that we don't wait a long time before failing the startup of Elasticsearch.

lcawl · 2019-06-14T16:31:45Z

docs/reference/settings/ml-settings.asciidoc

 such an external process.
+
+`xpack.ml.process_connect_timeout` (<<cluster-update-settings,Dynamic>>)::
+Some {ml} processing is done by processes that run separately to the {es} JVM.


It would be nice to have a brief introductory explanation here, akin to what we have in other timeout definitions.
e.g.

The connect(ion?) timeout for {ml} processes that run separately from the {es} JVM. Defaults to 10s. When such processes...

docs/reference/settings/ml-settings.asciidoc

dimitris-athanasiou

LGTM

droberts195 · 2019-06-25T12:04:09Z

Jenkins run elasticsearch-ci/1

This change introduces a new setting, xpack.ml.process_connect_timeout, to enable the timeout for one of the external ML processes to connect to the ES JVM to be increased. The timeout may need to be increased if many processes are being started simultaneously on the same machine. This is unlikely in clusters with many ML nodes, as we balance the processes across the ML nodes, but can happen in clusters with a single ML node and a high value for xpack.ml.node_concurrent_job_allocations.

droberts195 added >enhancement :ml Machine learning v8.0.0 v7.2.0 v7.3.0 v6.8.1 labels Jun 14, 2019

droberts195 requested a review from dimitris-athanasiou June 14, 2019 14:26

lcawl reviewed Jun 14, 2019

View reviewed changes

docs/reference/settings/ml-settings.asciidoc Outdated Show resolved Hide resolved

lcawl reviewed Jun 14, 2019

View reviewed changes

docs/reference/settings/ml-settings.asciidoc Outdated Show resolved Hide resolved

Address docs review comments

ce20a53

jakelandis added v6.8.2 and removed v7.2.0 v6.8.1 labels Jun 17, 2019

droberts195 added the v7.2.1 label Jun 18, 2019

dimitris-athanasiou approved these changes Jun 18, 2019

View reviewed changes

David Roberts added 3 commits June 18, 2019 13:10

Merge branch 'master' into make_connect_timeout_a_setting

aeaca51

Merge branch 'master' into make_connect_timeout_a_setting

824b4f2

Extend change to data frame analytics

2a5e8a5

Merge branch 'master' into make_connect_timeout_a_setting

15a6c90

droberts195 merged commit 76ad7d8 into elastic:master Jun 25, 2019

droberts195 deleted the make_connect_timeout_a_setting branch June 25, 2019 15:36

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML] Introduce a setting for the process connect timeout #43234

[ML] Introduce a setting for the process connect timeout #43234

Uh oh!

droberts195 commented Jun 14, 2019

Uh oh!

elasticmachine commented Jun 14, 2019

Uh oh!

droberts195 commented Jun 14, 2019 •

edited

Loading

Uh oh!

lcawl Jun 14, 2019

Uh oh!

Uh oh!

Uh oh!

dimitris-athanasiou left a comment

Uh oh!

droberts195 commented Jun 25, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[ML] Introduce a setting for the process connect timeout #43234

[ML] Introduce a setting for the process connect timeout #43234

Uh oh!

Conversation

droberts195 commented Jun 14, 2019

Uh oh!

elasticmachine commented Jun 14, 2019

Uh oh!

droberts195 commented Jun 14, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lcawl Jun 14, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dimitris-athanasiou left a comment

Choose a reason for hiding this comment

Uh oh!

droberts195 commented Jun 25, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

droberts195 commented Jun 14, 2019 •

edited

Loading