Add listenable TransportRequestHandler in TransportNodesAction #15166

zane-neo · 2024-08-08T07:34:37Z

Description

Add listenable TransportRequestHandler in TransportNodesAction.

Use case:
In ml-commons, a scenario is to deploy model to the cluster, this action needs to read the model metadata from index first and then do time-consuming operation for locally running model and non time-consuming operations for remote running model.

The thing is the current TransportNodesAction limited the API that developers have to return a response when performing nodeOperation, so we have to workaround this by forwarding the deploy result to a listener to update the model status, the workflow looks like below:

This works fine for the time-consuming models(run locally) but not for remote models because deploying remote models is lightweight which only needs create several object in memory after metadata read. Once we enhanced this API to support listenableTransportRequestHandler, then we can pass the listener to the noOperation and return actual model status instead of deploying.

Also recently we found an issue of model deployment which is caused by the forwarding deploy result to listener part, simply put, if a node crash during the model deploy, then the listener will not able to receive all responses and not updating the model status, please refer: opensearch-project/ml-commons#2970.

So enhancing this class offering a straightforward solution to this scenario and I believe it can benefit other similar cases which accept a listener during the nodeOperation.

Related Issues

#15165

Check List

Functionality includes testing.
API changes companion pull request created, if applicable.
Public documentation issue/PR created, if applicable.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

github-actions · 2024-08-08T08:22:32Z

✅ Gradle check result for 11c214f: SUCCESS

codecov · 2024-08-08T08:23:59Z

Codecov Report

Attention: Patch coverage is 83.33333% with 3 lines in your changes missing coverage. Please review.

Project coverage is 72.13%. Comparing base (b67cdf4) to head (a11a8a1).

Files with missing lines	Patch %	Lines
...rch/action/support/nodes/TransportNodesAction.java	83.33%	3 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##               main   #15166      +/-   ##
============================================
+ Coverage     72.03%   72.13%   +0.10%     
- Complexity    65230    65261      +31     
============================================
  Files          5318     5318              
  Lines        304051   304069      +18     
  Branches      43990    43991       +1     
============================================
+ Hits         219021   219355     +334     
+ Misses        67121    66747     -374     
- Partials      17909    17967      +58

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

github-actions · 2024-08-09T06:06:26Z

❌ Gradle check result for f17a5c0: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

github-actions · 2024-08-13T02:17:45Z

❌ Gradle check result for 3c68c3f: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

zane-neo · 2024-08-15T06:35:25Z

@peternied @gaobinlong @dblock Can you help review this PR?

opensearch-trigger-bot · 2024-09-14T15:21:37Z

This PR is stalled because it has been open for 30 days with no activity.

github-actions · 2024-09-29T08:14:51Z

❌ Gradle check result for cdd58e2: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

gaobinlong

@zane-neo Could you give some use cases about this change to help understanding the intention, thanks!

zane-neo · 2024-10-08T01:36:43Z

@zane-neo Could you give some use cases about this change to help understanding the intention, thanks!

@gaobinlong Thanks, Binlong. I'll add more cases in the description and the issue.

github-actions · 2024-10-08T04:01:49Z

❌ Gradle check result for 2e6023c: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

github-actions · 2024-10-08T09:29:01Z

✅ Gradle check result for 2e6023c: SUCCESS

gaobinlong

LGTM.

gaobinlong · 2024-10-08T09:45:46Z

@rajiv-kv @shwetathareja could you also help to take a look at this PR, thanks!

github-actions · 2024-10-17T06:50:09Z

❕ Gradle check result for 067fc68: UNSTABLE

Please review all flaky tests that succeeded after retry and create an issue if one does not already exist to track the flaky failure.

Signed-off-by: zane-neo <zaniu@amazon.com>

github-actions · 2024-12-13T08:16:18Z

✅ Gradle check result for a11a8a1: SUCCESS

zane-neo marked this pull request as ready for review August 9, 2024 05:57

zane-neo requested review from anasalkouz, andrross, ashking94, Bukhtawar, CEHENKLE, dblock, dbwiddis, gbbafna, kotwanikunal, linuxpi, mch2, msfroh, nknize, owaiskazi19, reta, Rishikesh1159, sachinpkale, saratvemulapalli, shwetathareja, sohami and VachaShah as code owners August 9, 2024 05:57

zane-neo changed the title ~~[]Add listenable TransportRequestHandler in TransportNodesAction~~ Add listenable TransportRequestHandler in TransportNodesAction Aug 15, 2024

gaobinlong added the backport 2.x Backport to 2.x branch label Aug 15, 2024

opensearch-trigger-bot bot added the stalled Issues that have stalled label Sep 14, 2024

zane-neo force-pushed the add-listenable-transport-request-handler branch from 3c68c3f to 62c17d5 Compare September 29, 2024 07:03

zane-neo requested a review from jainankitk as a code owner September 29, 2024 07:03

This was referenced Sep 29, 2024

[AUTOCUT] Gradle Check Flaky Test Report for RemoteRestoreSnapshotIT #14324

Open

[AUTOCUT] Gradle Check Flaky Test Report for IndicesRequestCacheIT #14288

Open

opensearch-trigger-bot bot removed the stalled Issues that have stalled label Sep 29, 2024

gaobinlong reviewed Sep 30, 2024

View reviewed changes

opensearch-ci-bot mentioned this pull request Oct 1, 2024

[AUTOCUT] Gradle Check Flaky Test Report for RemoteSplitIndexIT #14296

Open

zane-neo force-pushed the add-listenable-transport-request-handler branch from cdd58e2 to 2e6023c Compare October 8, 2024 02:52

opensearch-ci-bot mentioned this pull request Oct 8, 2024

[AUTOCUT] Gradle Check Flaky Test Report for SearchOnlyReplicaIT #15812

Closed

gaobinlong approved these changes Oct 8, 2024

View reviewed changes

zane-neo force-pushed the add-listenable-transport-request-handler branch 3 times, most recently from c9a2500 to 067fc68 Compare October 17, 2024 05:46

opensearch-ci-bot mentioned this pull request Oct 16, 2024

[AUTOCUT] Gradle Check Flaky Test Report for SearchTimeoutIT #16056

Open

zane-neo added 2 commits December 13, 2024 15:06

Add listenable TransportRequestHandler in TransportNodesAction

3f1f5bd

Signed-off-by: zane-neo <zaniu@amazon.com>

Add UT for listenable transport request handler

a11a8a1

Signed-off-by: zane-neo <zaniu@amazon.com>

zane-neo force-pushed the add-listenable-transport-request-handler branch from 067fc68 to a11a8a1 Compare December 13, 2024 07:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add listenable TransportRequestHandler in TransportNodesAction #15166

Add listenable TransportRequestHandler in TransportNodesAction #15166

zane-neo commented Aug 8, 2024 •

edited

Loading

github-actions bot commented Aug 8, 2024

codecov bot commented Aug 8, 2024 •

edited

Loading

github-actions bot commented Aug 9, 2024

github-actions bot commented Aug 13, 2024

zane-neo commented Aug 15, 2024

opensearch-trigger-bot bot commented Sep 14, 2024

github-actions bot commented Sep 29, 2024

gaobinlong left a comment

zane-neo commented Oct 8, 2024

github-actions bot commented Oct 8, 2024

github-actions bot commented Oct 8, 2024

gaobinlong left a comment

gaobinlong commented Oct 8, 2024

github-actions bot commented Oct 17, 2024

github-actions bot commented Dec 13, 2024

Add listenable TransportRequestHandler in TransportNodesAction #15166

Are you sure you want to change the base?

Add listenable TransportRequestHandler in TransportNodesAction #15166

Conversation

zane-neo commented Aug 8, 2024 • edited Loading

Description

Related Issues

Check List

github-actions bot commented Aug 8, 2024

codecov bot commented Aug 8, 2024 • edited Loading

Codecov Report

github-actions bot commented Aug 9, 2024

github-actions bot commented Aug 13, 2024

zane-neo commented Aug 15, 2024

opensearch-trigger-bot bot commented Sep 14, 2024

github-actions bot commented Sep 29, 2024

gaobinlong left a comment

Choose a reason for hiding this comment

zane-neo commented Oct 8, 2024

github-actions bot commented Oct 8, 2024

github-actions bot commented Oct 8, 2024

gaobinlong left a comment

Choose a reason for hiding this comment

gaobinlong commented Oct 8, 2024

github-actions bot commented Oct 17, 2024

github-actions bot commented Dec 13, 2024

zane-neo commented Aug 8, 2024 •

edited

Loading

codecov bot commented Aug 8, 2024 •

edited

Loading