[RelayMiner]: add `proxy.Ping(...)` capability to test connectivity between relay servers and backend URLs #1037

eddyzags · 2025-01-21T17:12:37Z

Summary

This PR adds the capability to test the connectivity between the Relay Servers and the Backend URLs in two ways.

Safeguard at Startup:
For every suppliers.[].service_config.backend_url referenced as input inside the Relay Miner Configuration file, the Relay Proxy will verify wether the network connection between the targeted backend_url and the relayerminer process is functioning properly. If one or more connections aren't possible, the relay miner won't be able to start.
Configurable Ping HTTP server:
The Relay Miner process will listen for incoming request to synchronously test the connectivity of every referenced suppliers.[].service_config.backend_url. If one or more backend URLs aren't reachable, the incoming request will fail.

Based on the serverConfig.ServerType (Example: HTTP), each Server Type will implement their own logic to implement to test the connectivity.

Issue

[RelayMiner] Add an ability to test RelayMiner configuration without application #447

Type of change

Select one or more:

Testing

Documentation changes (only if making doc changes)

make docusaurus_start; only needed if you make doc changes

Local Testing (only if making code changes)

Unit Tests: make go_develop_and_test
LocalNet E2E Tests: make test_e2e
See quickstart guide for instructions

PR Testing (only if making code changes)

DevNet E2E Tests: Add the devnet-test-e2e label to the PR.
- THIS IS VERY EXPENSIVE, so only do it after all the reviews are complete.
- Optionally run make trigger_ci if you want to re-trigger tests without any code changes
- If tests fail, try re-running failed tests only using the GitHub UI as shown here

Sanity Checklist

I have tested my changes using the available tooling
I have commented my code
I have performed a self-review of my own code; both comments & source code
I create and reference any new tickets, if applicable
I have left TODOs throughout the codebase, if applicable

Summary by CodeRabbit

New Features
- Introduced a new configuration section for the ping functionality, allowing users to test backend connectivity within the relay miner's setup.
- Added methods to handle ping requests, enhancing health check capabilities for relay servers.
Bug Fixes
- Improved error handling during the server startup process if any relay server is unreachable.
Tests
- Added tests for the new ping functionality to ensure operational integrity and reliability of the relay miner.

…ty between relay servers and backend URLs (#1) * relayer: add RelayServers() method to RelayProxy interface; Add Ping(), ServiceIDs(), Forward() method to RelayServer interface; add RelayServers slice with helper method byServiceID * relayer: add forward config entry * relayer: implement ServiceIDs, Forward, and Ping method for synchrounous RPC server * relayer: add RelayServers implementation for RelayProxy * relayer: add Ping and Forward options * relayer: integrate ping option * relayer: add ServePing and ServeForward method to RelayMiner * test proxy.Ping() in test + remove forward feature * add serve ping test * add doc

…s based on localnet config

…tfile

bryanchriswhite

Thanks for picking this back up @eddyzags! 🙌

I have to stop here for today but this is looking great so far! 🚀
The biggest thing I haven't reviewed yet is the test (but I already saw the addition of go-mockdns, and I skimmed the test names 😉) and am looking forward to it.

bryanchriswhite · 2025-01-24T10:45:57Z

Tiltfile

Was this change intentionally persisted, and if so, how is it related to this feature?

I think this change should be reverted. My assumption is that this is the result of an older commit which was never reconciled completely with main:

The yaml files referenced don't exist.

The flags seem to be specifying the same/similar config as what's been removed from the relayminer configs that do exist. 🤔

Sorry, I wasn't clear in my previous comments.

Was this change intentionally persisted, and if so, how is it related to this feature?

Yes, this change was intentionally made to ensure the Ping safeguard at startup succeeds for the Relayminer with the localnet default configuration, and/or any custom localnet configuration in that regard (link to localnet default configuration in the main branch). In the default localnet configuration, the Ollama Kubernetes deployment is not applied (ollama.enabled=false). However, the relayminer configuration still referenced Ollama suppliers in its configuration files, even though the container wasn’t deployed (link to relayminer-1 configuration for localnet). With the newly introduced mechanism of the Ping safeguard at startup, this will cause the relayminer to fail continuously because the Ollama container isn't deployed.

To solve this issue, I found a way to dynamically define the relayminer's configuration based on the localnet configuration by modifying the poktrolld/Tiltfile. Hence, those modifications.

For poktrolld users that are deploying a Relayminer without relying on the localnet, they will have to make sure that their config.suppliers[*].service_config.backend_url are up and running and reachable before deploying a Relayminer.

The yaml files referenced don't exist.

I disagree, they exists:

values-common.yaml is defined here

values-relayminer-common.yaml is defined here

values-relayminer- + str(actor_number) + ".yaml" is defined here, here and here

The flags seem to be specifying the same/similar config as what's been removed from the relayminer configs that do exist.

I cannot find that. Can you link me to the precise line in my fork that makes you think that please? 🙏🏾

Tiltfile

bryanchriswhite · 2025-01-24T10:48:37Z

localnet/kubernetes/values-relayminer-1.yaml

It seems to me that these relayminer configs (1-3) should be reverted.

see comment here #1037

docusaurus/docs/operate/configs/relayminer_config.md

pkg/relayer/proxy/synchronous.go

pkg/relayer/relayminer_test.go

bryanchriswhite · 2025-01-24T13:31:17Z

testutil/testproxy/relayerproxy.go

-				server.Handler = http.HandlerFunc(func(w http.ResponseWriter, _ *http.Request) {
-					sendJSONRPCResponse(test.t, w)
-				})
+				listener, err := net.Listen("tcp", supplierConfig.ServiceConfig.BackendUrl.Host)


Why separate the listener from the server?

By using a custom listener, and thereby decoupling the listener from the serve action, we ensure that the HTTP server is fully prepared to listen on a specific port in the test's main Go routine. This guarantees that the HTTP server(s) is ready before proceeding to the actual test cases.

Previously, listening and serving were handled within the Go routine using http.ListenAndServe function. This approach sometimes led to the HTTP server not being ready when the test cases began execution, resulting in test failures and flaky behavior.

Amazing! 👍 #PUC with that explanation, perhaps condensed, if possible.

eddyzags · 2025-01-27T15:04:36Z

Thanks for reviewing @bryanchriswhite ! Waiting for the rest of the review 🚀

eddyzags added 30 commits January 21, 2025 02:30

use dynamic slice in Ping error handling

95b2c48

relayer: add godoc to configuration yaml

79503c1

proxy: add comment explaining application logic

18be051

relayer: change Ping to PingAll in relayproxy interface

e226da8

proxy: cleanup unused code

49b5a63

relayer: add godoc explaining ping http serve method

e742079

relayer: remove blank line

843822b

localnet: add ping helpers for relayminers + logic to define supplier…

e31078f

…s based on localnet config

use errors.Join instead of appending errors slice

42704eb

change c to httpClient in sychronous in relay server

41550ab

add relayer miner suppplier not reachable error

cee1033

add newpinghandlerfn function

a10ce28

add comment relayminer test

34dff88

simplified newmockonetimerelayerproxywithping function

18e5536

revert Makefile and add local helpers

34a3553

add 204 no content for ping response

6ac6434

add comments to addr for ping config

063f93c

revert Makefile

40cb29e

fix typo

6d6c8cb

add statuscode assertion while testing ping server

255a45b

add localnet helpers to ping relayminer 1 2 3 + port exposition

8bfdab8

add more context to synchrounous rpc ping error

c5c37ad

change c to httpClient in relayminer tests

f68defb

add more context to transport override in relayminer tests

d4b66a5

add transport varialbe in relayminer tests

69c7a29

add 502 bad gateway as http code response for /ping

78f3fe3

add tcp listener to relayerminer pkg

954bb98

fix code registration for ErrRelayerProxySupplierNotReachable

f1f050f

add endpointURL variable

486189a

eddyzags added 7 commits January 21, 2025 02:30

improve godoc comments for pingconfig

d8b919e

fix serveping tests + refactor relayminer.ServePing function signature

af65206

refactor: dynamically set values for relayminer suppliers list in Til…

b99ec26

…tfile

add pingall test suite

d6173a6

add proxy different endpoints ping tests

987b2ec

stabilize flaky helpers in testproxy

ea1fdf6

add comments and refactor variable names

f8eda1f

eddyzags mentioned this pull request Jan 21, 2025

[RelayMiner]: add proxy.Ping(...) capability to test connectivity between relay servers and backend URLs #744

Closed

14 tasks

Olshansk requested review from Olshansk, bryanchriswhite and red-0ne January 21, 2025 23:05

Olshansk assigned eddyzags Jan 21, 2025

Olshansk added tooling Tooling - CLI, scripts, helpers, off-chain, etc... community A ticket intended to potentially be picked up by a community member nice-to-have Not-important and not-urgent labels Jan 22, 2025

Olshansk added this to the Beta TestNet Iteration milestone Jan 22, 2025

Merge branch 'main' into main

6c9f71a

bryanchriswhite requested changes Jan 24, 2025

View reviewed changes

eddyzags added 6 commits January 27, 2025 14:07

fix typo in Tiltfile

5d0f020

fix tyop

09967f0

categorized as errors any HTTP status code higher or equal to 400

1f9325f

improve error handling while serving http request for ping server

446fcd1

distinguish 502 from 503 errors in ping handler for error handling

6cf947b

rely on testing temp dir for testing files

6386078

eddyzags force-pushed the main branch from bedf1a1 to 6386078 Compare January 27, 2025 14:24

improve relayminer configuration documentation

8529cbc

eddyzags requested a review from bryanchriswhite January 27, 2025 15:04

eddyzags added 2 commits January 28, 2025 05:24

improve error message for ping request

10e7f12

minor fix

6827b46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RelayMiner]: add `proxy.Ping(...)` capability to test connectivity between relay servers and backend URLs #1037

[RelayMiner]: add `proxy.Ping(...)` capability to test connectivity between relay servers and backend URLs #1037

eddyzags commented Jan 21, 2025

bryanchriswhite left a comment

bryanchriswhite Jan 24, 2025 •

edited

Loading

eddyzags Jan 27, 2025

bryanchriswhite Jan 24, 2025

eddyzags Jan 27, 2025

bryanchriswhite Jan 24, 2025

eddyzags Jan 27, 2025

bryanchriswhite Jan 28, 2025 •

edited

Loading

eddyzags commented Jan 27, 2025

[RelayMiner]: add proxy.Ping(...) capability to test connectivity between relay servers and backend URLs #1037

Are you sure you want to change the base?

[RelayMiner]: add proxy.Ping(...) capability to test connectivity between relay servers and backend URLs #1037

Conversation

eddyzags commented Jan 21, 2025

Summary

Issue

Type of change

Testing

Sanity Checklist

Summary by CodeRabbit

Summary by CodeRabbit

bryanchriswhite left a comment

Choose a reason for hiding this comment

bryanchriswhite Jan 24, 2025 • edited Loading

Choose a reason for hiding this comment

eddyzags Jan 27, 2025

Choose a reason for hiding this comment

bryanchriswhite Jan 24, 2025

Choose a reason for hiding this comment

eddyzags Jan 27, 2025

Choose a reason for hiding this comment

bryanchriswhite Jan 24, 2025

Choose a reason for hiding this comment

eddyzags Jan 27, 2025

Choose a reason for hiding this comment

bryanchriswhite Jan 28, 2025 • edited Loading

Choose a reason for hiding this comment

eddyzags commented Jan 27, 2025

[RelayMiner]: add `proxy.Ping(...)` capability to test connectivity between relay servers and backend URLs #1037

[RelayMiner]: add `proxy.Ping(...)` capability to test connectivity between relay servers and backend URLs #1037

bryanchriswhite Jan 24, 2025 •

edited

Loading

bryanchriswhite Jan 28, 2025 •

edited

Loading