Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ML] Parent datafeed actions to the datafeed's persistent task #81143

Conversation

dimitris-athanasiou
Copy link
Contributor

@dimitris-athanasiou dimitris-athanasiou commented Nov 30, 2021

The vast majority of a datafeed's actions are executed from the
data extractor. This includes the heaviest actions which are the
searches. These are currently not parented correctly to the datafeed's
task. This makes it hard to trace search tasks back to their datafeeds.

This commit passes a ParentTaskAssigningClient to
DataExtractorFactory.create which ensures the client used by
any extractor will be setting the corresponding task id: the action
task id for preview datafeed and the master operation stage of the
start datafeed action, and the persistent task id for the datafeed
operations after it has started.

The vast majority of a datafeed's actions are executed from the
data extractor. This includes the heaviest actions which are the
searches. This commit passes a `ParentTaskAssigningClient` to
`DataExtractorFactory.create` which ensures the client used by
any extractor will be setting the corresponding task id: the action
task id for preview datafeed and the master operation stage of the
start datafeed action, and the persistent task id for the datafeed
operations after it has started.
@elasticmachine elasticmachine added the Team:ML Meta label for the ML team label Nov 30, 2021
@elasticmachine
Copy link
Collaborator

Pinging @elastic/ml-core (Team:ML)

Copy link
Contributor

@przemekwitek przemekwitek left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@dimitris-athanasiou dimitris-athanasiou merged commit fe7c2d5 into elastic:master Nov 30, 2021
dimitris-athanasiou added a commit to dimitris-athanasiou/elasticsearch that referenced this pull request Nov 30, 2021
…ic#81143)

The vast majority of a datafeed's actions are executed from the
data extractor. This includes the heaviest actions which are the
searches. This commit passes a `ParentTaskAssigningClient` to
`DataExtractorFactory.create` which ensures the client used by
any extractor will be setting the corresponding task id: the action
task id for preview datafeed and the master operation stage of the
start datafeed action, and the persistent task id for the datafeed
operations after it has started.
@elasticsearchmachine
Copy link
Collaborator

💚 Backport successful

Status Branch Result
8.0
7.16

dimitris-athanasiou added a commit to dimitris-athanasiou/elasticsearch that referenced this pull request Nov 30, 2021
…ic#81143)

The vast majority of a datafeed's actions are executed from the
data extractor. This includes the heaviest actions which are the
searches. This commit passes a `ParentTaskAssigningClient` to
`DataExtractorFactory.create` which ensures the client used by
any extractor will be setting the corresponding task id: the action
task id for preview datafeed and the master operation stage of the
start datafeed action, and the persistent task id for the datafeed
operations after it has started.
elasticsearchmachine pushed a commit that referenced this pull request Nov 30, 2021
… (#81152)

The vast majority of a datafeed's actions are executed from the
data extractor. This includes the heaviest actions which are the
searches. This commit passes a `ParentTaskAssigningClient` to
`DataExtractorFactory.create` which ensures the client used by
any extractor will be setting the corresponding task id: the action
task id for preview datafeed and the master operation stage of the
start datafeed action, and the persistent task id for the datafeed
operations after it has started.
weizijun added a commit to weizijun/elasticsearch that referenced this pull request Nov 30, 2021
* upstream/master:
  [ML] Parent datafeed actions to the datafeed's persistent task (elastic#81143)
  Simplify ConcreteIndices and its usage in TransportBulkAction (elastic#81098)
  Unmute DataStreamsSnapshotsIT#testRestoreDataStreamAliasWithConflictingIndicesAlias() test (elastic#81142)
  TSDB: Do not allow index splits for time series indices (elastic#81125)
  Reduce verbosity-increase timeout to 3m (elastic#81118)
  Mute DataStreamsSnapshotsIT#testRestoreDataStreamAliasWithConflictingIndicesAlias() test
  Fix stopping of old elasticsearch cluster (elastic#81059)
  Fix data stream alias validation. (elastic#81040)
  Add replicated field to get data stream api response. (elastic#80988)
elasticsearchmachine pushed a commit that referenced this pull request Nov 30, 2021
…#81143) (#81153)

* [ML] Parent datafeed actions to the datafeed's persistent task (#81143)

The vast majority of a datafeed's actions are executed from the
data extractor. This includes the heaviest actions which are the
searches. This commit passes a `ParentTaskAssigningClient` to
`DataExtractorFactory.create` which ensures the client used by
any extractor will be setting the corresponding task id: the action
task id for preview datafeed and the master operation stage of the
start datafeed action, and the persistent task id for the datafeed
operations after it has started.

* Can't parent on the start datafeed master op as we don't have task
@dimitris-athanasiou dimitris-athanasiou deleted the parent-datafeed-searches branch November 30, 2021 15:51
@danhermann danhermann added v7.16.0 and removed v7.16.1 labels Dec 3, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
>bug :ml Machine learning Team:ML Meta label for the ML team v7.16.0 v8.0.0-rc1 v8.1.0
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants