Health endpoint improvement #652

davidgengenbach · 2019-12-05T13:42:09Z

Is your feature request related to a problem? Please describe.
We had a problem with a killed plugin process (due to OOM) which resulted in non-executing jobs.

Describe the solution you'd like
The health endpoint might be used to check whether all plugin processes are up and running.
In general, more health checks would be helpful, e.g. cluster health?

The endpoint could return a non-200 status code when the instance is not healthy!

vcastellm · 2019-12-10T22:39:10Z

Already on the roadmap, will work on this.

vcastellm · 2019-12-10T23:11:23Z

@davidgengenbach not really the improvement you mention but I think it's better to fail fast in case of a missing plugin. In case of using as a service the OS supervisor will take care of restarting. This is the case with processor plugins.

davidgengenbach · 2019-12-10T23:45:30Z

@Victorcoder Yes, I would agree. Having non-functional (= killed) plugins should really result in a killed main-process.

The linked PR will only kill the main process when a job has been executed unsuccessfully (e.g. by a plugin error) which may be far later than the actual plugin process exiting - a periodic health check could circumvent this.

vcastellm added the enhancement label Dec 10, 2019

vcastellm mentioned this issue Dec 10, 2019

refactor: Die on plugin communication error #658

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Health endpoint improvement #652

Health endpoint improvement #652

davidgengenbach commented Dec 5, 2019 •

edited

Loading

vcastellm commented Dec 10, 2019

vcastellm commented Dec 10, 2019

davidgengenbach commented Dec 10, 2019

Health endpoint improvement #652

Health endpoint improvement #652

Comments

davidgengenbach commented Dec 5, 2019 • edited Loading

vcastellm commented Dec 10, 2019

vcastellm commented Dec 10, 2019

davidgengenbach commented Dec 10, 2019

davidgengenbach commented Dec 5, 2019 •

edited

Loading