[Search Pipelines] Add stats for search pipelines #8053

msfroh · 2023-06-14T00:18:44Z

Description

This adds statistics on executions and time spent on search pipeline operations, similar to the stats that are available for ingest pipelines.

Related Issues

Resolves #6723

Check List

New functionality includes testing.
- All tests pass
New functionality has been documented.
- New functionality has javadoc added
Commits are signed per the DCO using --signoff
Commit changes are listed out in CHANGELOG.md file (See: Changelog)

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

github-actions · 2023-06-14T00:24:26Z

Gradle Check (Jenkins) Run Completed with:

RESULT: FAILURE ❌
URL: https://build.ci.opensearch.org/job/gradle-check/17519/
CommitID: a94b09a
Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green.
Is the failure a flaky test unrelated to your change?

github-actions · 2023-06-14T00:53:33Z

Gradle Check (Jenkins) Run Completed with:

RESULT: FAILURE ❌
URL: https://build.ci.opensearch.org/job/gradle-check/17520/
CommitID: 97f64b3
Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green.
Is the failure a flaky test unrelated to your change?

github-actions · 2023-06-14T01:08:18Z

Gradle Check (Jenkins) Run Completed with:

RESULT: FAILURE ❌
URL: https://build.ci.opensearch.org/job/gradle-check/17522/
CommitID: 3f3a7bb
Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green.
Is the failure a flaky test unrelated to your change?

github-actions · 2023-06-14T19:40:20Z

Gradle Check (Jenkins) Run Completed with:

RESULT: FAILURE ❌
URL: https://build.ci.opensearch.org/job/gradle-check/17599/
CommitID: 0d3c6db
Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green.
Is the failure a flaky test unrelated to your change?

github-actions · 2023-06-15T01:06:07Z

Gradle Check (Jenkins) Run Completed with:

RESULT: UNSTABLE ❕
TEST FAILURES:

      1 org.opensearch.snapshots.DedicatedClusterSnapshotRestoreIT.testIndexDeletionDuringSnapshotCreationInQueue
      1 org.opensearch.search.backpressure.SearchBackpressureIT.testSearchShardTaskCancellationWithHighCpu

URL: https://build.ci.opensearch.org/job/gradle-check/17628/
CommitID: a6a5784
Please review all flaky tests that succeeded after retry and create an issue if one does not already exist to track the flaky failure.

codecov · 2023-06-15T01:08:42Z

Codecov Report

Merging #8053 (159e568) into main (1cf9c5c) will increase coverage by 0.03%.
The diff coverage is 87.93%.

@@             Coverage Diff              @@
##               main    #8053      +/-   ##
============================================
+ Coverage     70.89%   70.92%   +0.03%     
- Complexity    56639    56656      +17     
============================================
  Files          4722     4725       +3     
  Lines        267674   267987     +313     
  Branches      39232    39267      +35     
============================================
+ Hits         189757   190063     +306     
+ Misses        61937    61864      -73     
- Partials      15980    16060      +80

Impacted Files	Coverage Δ
...min/cluster/stats/TransportClusterStatsAction.java	`69.56% <ø> (ø)`
...src/main/java/org/opensearch/node/NodeService.java	`73.49% <0.00%> (-0.90%)`	⬇️
...rch/action/admin/cluster/node/stats/NodeStats.java	`50.54% <33.33%> (-0.90%)`	⬇️
.../action/admin/cluster/stats/ClusterStatsNodes.java	`55.00% <55.55%> (+5.83%)`	⬆️
.../org/opensearch/common/metrics/OperationStats.java	`78.78% <78.78%> (ø)`
.../java/org/opensearch/search/pipeline/Pipeline.java	`87.83% <85.96%> (-1.82%)`	⬇️
...pensearch/search/pipeline/SearchPipelineStats.java	`89.44% <89.44%> (ø)`
...pensearch/search/pipeline/PipelineWithMetrics.java	`90.90% <90.90%> (ø)`
...main/java/org/opensearch/ingest/IngestService.java	`83.37% <93.75%> (-1.16%)`	⬇️
...on/admin/cluster/node/stats/NodesStatsRequest.java	`93.33% <100.00%> (+0.09%)`	⬆️
... and 8 more

... and 481 files with indirect coverage changes

github-actions · 2023-06-22T21:15:48Z

Gradle Check (Jenkins) Run Completed with:

RESULT: UNSTABLE ❕
TEST FAILURES:

      2 org.opensearch.remotestore.RemoteStoreRefreshListenerIT.testRemoteRefreshRetryOnFailure
      1 org.opensearch.cluster.routing.allocation.decider.DiskThresholdDeciderIT.testIndexCreateBlockIsRemovedWhenAnyNodesNotExceedHighWatermarkWithAutoReleaseEnabled

URL: https://build.ci.opensearch.org/job/gradle-check/18225/
CommitID: 159e568
Please review all flaky tests that succeeded after retry and create an issue if one does not already exist to track the flaky failure.

msfroh · 2023-06-23T19:57:04Z

@saratvemulapalli -- Addressed your suggestions.

Windows precommit failed, but that seems to be a "Windows build server" thing and not a "this PR" thing.

saratvemulapalli · 2023-06-26T17:18:37Z

@saratvemulapalli -- Addressed your suggestions.

Windows precommit failed, but that seems to be a "Windows build server" thing and not a "this PR" thing.

Thanks @msfroh. I've triggered a re-run of window precommit. I haven't seen failures just in Windows platform.
If the failure persists we can dig in.

saratvemulapalli

Thanks @msfroh for taking care of the feedback.
LGTM!

msfroh · 2023-06-26T20:13:59Z

Thanks, @saratvemulapalli!

I resolved the open conversations, so it looks like there are no immediate merge blockers.

msfroh · 2023-06-26T20:55:29Z

Oh, the "Some checks were not successful" note seems to be there because the Mac OS precommit was canceled when the Windows precommit failed.

If someone can retry the Mac OS precommit, I think that should turn green.

opensearch-trigger-bot · 2023-06-28T21:27:50Z

The backport to 2.x failed:

The process '/usr/bin/git' failed with exit code 128

To backport manually, run these commands in your terminal:

# Fetch latest updates from GitHub
git fetch
# Create a new working tree
git worktree add ../.worktrees/backport-2.x 2.x
# Navigate to the new working tree
pushd ../.worktrees/backport-2.x
# Create a new branch
git switch --create backport/backport-8053-to-2.x
# Cherry-pick the merged commit of this pull request and resolve the conflicts
git cherry-pick -x --mainline 1 46c9a211b6b9490f6a7ac9425e946986cd51bed2
# Push it to GitHub
git push --set-upstream origin backport/backport-8053-to-2.x
# Go back to the original working tree
popd
# Delete the working tree
git worktree remove ../.worktrees/backport-2.x

Then, create a pull request where the base branch is 2.x and the compare/head branch is backport/backport-8053-to-2.x.

dblock · 2023-06-28T23:06:39Z

@msfroh needs manual backport if you want this in 2.x

mingshl · 2023-06-29T17:26:13Z

I will help with the backport to 2.x

@lukas-vlcek

…#8053) * [Search Pipelines] Add stats for search pipelines This adds statistics on executions and time spent on search pipeline operations, similar to the stats that are available for ingest pipelines. Signed-off-by: Michael Froh <froh@amazon.com> * Compare parsed JSON structure, not exact JSON string As @lukas-vlcek pointed out, asserting equality with an exact JSON string is sensitive to formatting, which makes the test brittle. Instead, we can parse the expected JSON and compare as Maps. Signed-off-by: Michael Froh <froh@amazon.com> * Refactor to common stats/metrics classes Search pipelines and ingest pipelines had identical functionality for tracking metrics around operations and converting those to immutable "stats" objects. That approach isn't even really specific to pipelines, but can be used to track metrics on any repeated operation, so I moved that common logic to the common.metrics package. Signed-off-by: Michael Froh <froh@amazon.com> * Split pipeline metrics tracking into its own class Thanks @saratvemulapalli for the suggestion! This lets the Pipeline class focus on transforming requests / responses, while the subclass focuses on tracking and managing metrics. Signed-off-by: Michael Froh <froh@amazon.com> --------- Signed-off-by: Michael Froh <froh@amazon.com> (cherry picked from commit 46c9a21)

@lukas-vlcek

…#8053) * [Search Pipelines] Add stats for search pipelines This adds statistics on executions and time spent on search pipeline operations, similar to the stats that are available for ingest pipelines. Signed-off-by: Michael Froh <froh@amazon.com> * Compare parsed JSON structure, not exact JSON string As @lukas-vlcek pointed out, asserting equality with an exact JSON string is sensitive to formatting, which makes the test brittle. Instead, we can parse the expected JSON and compare as Maps. Signed-off-by: Michael Froh <froh@amazon.com> * Refactor to common stats/metrics classes Search pipelines and ingest pipelines had identical functionality for tracking metrics around operations and converting those to immutable "stats" objects. That approach isn't even really specific to pipelines, but can be used to track metrics on any repeated operation, so I moved that common logic to the common.metrics package. Signed-off-by: Michael Froh <froh@amazon.com> * Split pipeline metrics tracking into its own class Thanks @saratvemulapalli for the suggestion! This lets the Pipeline class focus on transforming requests / responses, while the subclass focuses on tracking and managing metrics. Signed-off-by: Michael Froh <froh@amazon.com> --------- Signed-off-by: Michael Froh <froh@amazon.com> (cherry picked from commit 46c9a21)

backport commit 46c9a21 to `2.x` branch. * [Search Pipelines] Add stats for search pipelines (#8053) This adds statistics on executions and time spent on search pipeline operations, similar to the stats that are available for ingest pipelines. Signed-off-by: Mingshi Liu <mingshl@amazon.com> Co-authored-by: Michael Froh <froh@amazon.com>

Update the version of BWC test from 3.0 to 2.9 for search pipeline statistic after the PR #8053 / commit 46c9a21 backported to 2.x branch. Signed-off-by: Mingshi Liu <mingshl@amazon.com>

Update the version of BWC test from 3.0 to 2.9 for search pipeline statistic after the PR opensearch-project#8053 / commit opensearch-project@46c9a21 backported to 2.x branch. Signed-off-by: Mingshi Liu <mingshl@amazon.com>

Update the version of BWC test from 3.0 to 2.9 for search pipeline statistic after the PR opensearch-project#8053 / commit opensearch-project@46c9a21 backported to 2.x branch. Signed-off-by: Mingshi Liu <mingshl@amazon.com> Signed-off-by: sahil buddharaju <sahilbud@amazon.com>

@lukas-vlcek

…#8053) * [Search Pipelines] Add stats for search pipelines This adds statistics on executions and time spent on search pipeline operations, similar to the stats that are available for ingest pipelines. Signed-off-by: Michael Froh <froh@amazon.com> * Compare parsed JSON structure, not exact JSON string As @lukas-vlcek pointed out, asserting equality with an exact JSON string is sensitive to formatting, which makes the test brittle. Instead, we can parse the expected JSON and compare as Maps. Signed-off-by: Michael Froh <froh@amazon.com> * Refactor to common stats/metrics classes Search pipelines and ingest pipelines had identical functionality for tracking metrics around operations and converting those to immutable "stats" objects. That approach isn't even really specific to pipelines, but can be used to track metrics on any repeated operation, so I moved that common logic to the common.metrics package. Signed-off-by: Michael Froh <froh@amazon.com> * Split pipeline metrics tracking into its own class Thanks @saratvemulapalli for the suggestion! This lets the Pipeline class focus on transforming requests / responses, while the subclass focuses on tracking and managing metrics. Signed-off-by: Michael Froh <froh@amazon.com> --------- Signed-off-by: Michael Froh <froh@amazon.com>

Update the version of BWC test from 3.0 to 2.9 for search pipeline statistic after the PR opensearch-project#8053 / commit opensearch-project@46c9a21 backported to 2.x branch. Signed-off-by: Mingshi Liu <mingshl@amazon.com>

lukas-vlcek · 2023-08-01T15:28:31Z

Hi,

I wanted to friendly point out that IMO PRs like this should have had the breaking label.

I am maintaining external plugin (specifically the Prometheus exporter plugin) and as you can imagine renaming some of the methods to get operational stats about Ingest operations had impact (as the plugin needs to call them).

Specifically:

nodeIngestStats.getIngestCount()        ->  getCount()
nodeIngestStats.getIngestFailedCount()  ->  getFailedCount()
nodeIngestStats.getIngestCurrent()      ->  getCurrent()
nodeIngestStats.getIngestTimeInMillis() ->  getTotalTimeInMillis()

I totally get it that given current state of API isolation it is possible that any internal renaming is possibly a "breaking" change but hey ... it is. I do not know if it makes sense to establish a new rule about how the "breaking" label is applied but the fact is that change like this can cause compilation error somewhere else.

Lukáš

msfroh · 2023-08-01T15:37:49Z

I totally get it that given current state of API isolation it is possible that any internal renaming is possibly a "breaking" change but hey ... it is. I do not know if it makes sense to establish a new rule about how the "breaking" label is applied but the fact is that change like this can cause compilation error somewhere else.

Oh, shoot. I didn't expect that.

I'm not sure what the best option is there. I'll aim to add the "breaking" label if I rename an existing method in future, at least.

lukas-vlcek · 2023-08-01T15:41:07Z

@msfroh Do not feel sorry for this.
With 2.9.0 there was another breaking change that impacted me, see #7508 (comment)

I also do not know what is the solution here. We could introduce something like "internal_API_change" label which will not go to the change log but can be used on GitHub to filter tickets but it would have to be applied consistently and there is no way how to achieve this (except by increasing the work load).

The methods to get operation stats for Ingest operations were renamed. See the following ticket for more details: <opensearch-project/OpenSearch#8053> Signed-off-by: Lukáš Vlček <lukas.vlcek@aiven.io>

@lukas-vlcek

…#8053) * [Search Pipelines] Add stats for search pipelines This adds statistics on executions and time spent on search pipeline operations, similar to the stats that are available for ingest pipelines. Signed-off-by: Michael Froh <froh@amazon.com> * Compare parsed JSON structure, not exact JSON string As @lukas-vlcek pointed out, asserting equality with an exact JSON string is sensitive to formatting, which makes the test brittle. Instead, we can parse the expected JSON and compare as Maps. Signed-off-by: Michael Froh <froh@amazon.com> * Refactor to common stats/metrics classes Search pipelines and ingest pipelines had identical functionality for tracking metrics around operations and converting those to immutable "stats" objects. That approach isn't even really specific to pipelines, but can be used to track metrics on any repeated operation, so I moved that common logic to the common.metrics package. Signed-off-by: Michael Froh <froh@amazon.com> * Split pipeline metrics tracking into its own class Thanks @saratvemulapalli for the suggestion! This lets the Pipeline class focus on transforming requests / responses, while the subclass focuses on tracking and managing metrics. Signed-off-by: Michael Froh <froh@amazon.com> --------- Signed-off-by: Michael Froh <froh@amazon.com> Signed-off-by: Shivansh Arora <hishiv@amazon.com>

Update the version of BWC test from 3.0 to 2.9 for search pipeline statistic after the PR opensearch-project#8053 / commit opensearch-project@46c9a21 backported to 2.x branch. Signed-off-by: Mingshi Liu <mingshl@amazon.com> Signed-off-by: Shivansh Arora <hishiv@amazon.com>

msfroh force-pushed the search_pipeline_metrics branch from a94b09a to 97f64b3 Compare June 14, 2023 00:19

msfroh force-pushed the search_pipeline_metrics branch from 97f64b3 to 3f3a7bb Compare June 14, 2023 00:32

msfroh force-pushed the search_pipeline_metrics branch from 3f3a7bb to 0d3c6db Compare June 14, 2023 18:59

msfroh force-pushed the search_pipeline_metrics branch from 0d3c6db to a6a5784 Compare June 15, 2023 00:31

msfroh marked this pull request as ready for review June 15, 2023 01:13

msfroh requested review from reta, anasalkouz, andrross, Bukhtawar, CEHENKLE, dblock, gbbafna, setiah, kartg, kotwanikunal, mch2, nknize, owaiskazi19, Rishikesh1159, ryanbogan, saratvemulapalli, shwetathareja, dreamer-89 and tlfeng as code owners June 15, 2023 01:13

msfroh force-pushed the search_pipeline_metrics branch from 88074fd to 159e568 Compare June 22, 2023 20:44

saratvemulapalli approved these changes Jun 26, 2023

View reviewed changes

dblock merged commit 46c9a21 into opensearch-project:main Jun 28, 2023

msfroh mentioned this pull request Jun 28, 2023

Adding the SearchPhaseResultsProcessor interface in Search Pipeline #7283

Merged

6 tasks

mingshl mentioned this pull request Jun 30, 2023

[Backport 2.x] [Search Pipelines] Add stats for search pipelines #8376

Merged

6 tasks

noCharger mentioned this pull request Jul 6, 2023

[DOC] Search Pipeline Metrics opensearch-project/documentation-website#4511

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Search Pipelines] Add stats for search pipelines #8053

[Search Pipelines] Add stats for search pipelines #8053

msfroh commented Jun 14, 2023

github-actions bot commented Jun 14, 2023

github-actions bot commented Jun 14, 2023

github-actions bot commented Jun 14, 2023

github-actions bot commented Jun 14, 2023

github-actions bot commented Jun 15, 2023

codecov bot commented Jun 15, 2023 •

edited

Loading

github-actions bot commented Jun 22, 2023

msfroh commented Jun 23, 2023

saratvemulapalli commented Jun 26, 2023

saratvemulapalli left a comment

msfroh commented Jun 26, 2023

msfroh commented Jun 26, 2023 •

edited

Loading

opensearch-trigger-bot bot commented Jun 28, 2023

dblock commented Jun 28, 2023

mingshl commented Jun 29, 2023

lukas-vlcek commented Aug 1, 2023

msfroh commented Aug 1, 2023

lukas-vlcek commented Aug 1, 2023

[Search Pipelines] Add stats for search pipelines #8053

[Search Pipelines] Add stats for search pipelines #8053

Conversation

msfroh commented Jun 14, 2023

Description

Related Issues

Check List

github-actions bot commented Jun 14, 2023

Gradle Check (Jenkins) Run Completed with:

github-actions bot commented Jun 14, 2023

Gradle Check (Jenkins) Run Completed with:

github-actions bot commented Jun 14, 2023

Gradle Check (Jenkins) Run Completed with:

github-actions bot commented Jun 14, 2023

Gradle Check (Jenkins) Run Completed with:

github-actions bot commented Jun 15, 2023

Gradle Check (Jenkins) Run Completed with:

codecov bot commented Jun 15, 2023 • edited Loading

Codecov Report

github-actions bot commented Jun 22, 2023

Gradle Check (Jenkins) Run Completed with:

msfroh commented Jun 23, 2023

saratvemulapalli commented Jun 26, 2023

saratvemulapalli left a comment

Choose a reason for hiding this comment

msfroh commented Jun 26, 2023

msfroh commented Jun 26, 2023 • edited Loading

opensearch-trigger-bot bot commented Jun 28, 2023

dblock commented Jun 28, 2023

mingshl commented Jun 29, 2023

lukas-vlcek commented Aug 1, 2023

msfroh commented Aug 1, 2023

lukas-vlcek commented Aug 1, 2023

codecov bot commented Jun 15, 2023 •

edited

Loading

msfroh commented Jun 26, 2023 •

edited

Loading