Project Specific Test Probes #3009

lfield · 2019-02-07T13:19:14Z

What I would like to do be able to do is to write a test probe that can be run to detect the sanity of the host before a task is downloaded. For example, testing that VritualBox is available. If this probe fails, no task will be downloaded. These probes would be project specific. It is to avoid situations where tasks fail due to a known and detectable issues with the host that are application specific . I would be happy to provide a PR is we can agree on the solution.

davidpanderson · 2019-02-07T22:11:27Z

Another approach would be to have the validator check for specific error messages or exit codes, and update the host record (or some new table) in a way that prevents more jobs of that type from being sent to the host.

lfield · 2019-02-08T09:22:15Z

Is the validator run against results that returned an error?

davidpanderson · 2019-02-09T02:37:01Z

No; we'd have to add an option.

The other question is where to store the info.
The natural place is the host_app_version table.
This has a field max_jobs_per_day that could be used;
e.g. -1 means don't use this app version for this host.
I'd have to make some small changes for this.

lfield · 2019-02-14T10:34:51Z

Yes, reducing this to one per day should be fine. Related to this is that I am using the wrapper and my script returns 206 if there is a problem. However, it looks like this is not being used and the wrapper is returning 195 EXIT_CHILD_FAILED. Another feature would be to reset the max_jobs_per_day for a specific host. This means that a miss configured node could be removed for a day but if a person is trying to fix the problem they can still get new tasks when they need to test.

So yes having something where we can parse the stderr for error messages and set the max_jobs_per_day to 1 would be sufficient.

This was referenced Mar 15, 2019

Server: add "punitive validation" mechanism #3024

Merged

Server/scheduler: Current default code doesn't inhibit work supply to faulty hosts #3061

Open

lfield closed this as completed Apr 5, 2019

AenBleidd added this to the Server Release 1.2.0 milestone Aug 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Project Specific Test Probes #3009

Project Specific Test Probes #3009

lfield commented Feb 7, 2019

davidpanderson commented Feb 7, 2019

lfield commented Feb 8, 2019

davidpanderson commented Feb 9, 2019

lfield commented Feb 14, 2019

Project Specific Test Probes #3009

Project Specific Test Probes #3009

Comments

lfield commented Feb 7, 2019

davidpanderson commented Feb 7, 2019

lfield commented Feb 8, 2019

davidpanderson commented Feb 9, 2019

lfield commented Feb 14, 2019