You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Nov 8, 2022. It is now read-only.
Currently if plugin reports an error to the framework the relevant tasks (using that plugin) will eventually be disabled after 10 consecutive plugin errors. This can lead, for example in case of intermittent network issues, to disabling tasks permanently. In order to start tasks again plugin must be unloaded, loaded again and tasks restarted. Possible options to consider:
Make task failing mechanism configurable:
allow to disable this mechanism - do not disable task in case of failing plugin ever
allow for configurable number of re-tries
Change default behavior (counting errors at each interval) to introduce something like exponential back-off algorithm
Update documentation to make plugin author aware in what cases plugin reporting error to framework may lead to task becoming disabled
Distinguish plugin critical errors that should lead to task being disabled from less severe errors that should start exponential backoff failing algorithm.
Allow task flow designer to decided what to do in case plugin errors
The text was updated successfully, but these errors were encountered:
@andrzej-k: In general terms how do you see something like an exponential back-off algorithm working in this situation. Are you inferring that a plugin returning errors could result in adjusting the rate/interval of a task(s).
@jcooklin: Possibly (to reduce resources utilization and to avoid over-flooding user with errors), but at least user should be able to decide if failing plugin should result in task being disabled. This could be configurable in task manifest (per plugin config), for example "task_failing_strategy":
simple (default) - as currently, stop task after x numer of consecutive errors
back-off - invoke plugin gradually less often, but keep task alive
ignore errors - keep alive despite errors
Tolerable downtime per plugin as a configurable item could be another option.
I believe the simple case has been covered by #1127. I think adding a way to ignore errors would be nice. I'm not sure how I feel about the back-off since that seems like it would be modifying the interval (which is part of the task definition).
Currently if plugin reports an error to the framework the relevant tasks (using that plugin) will eventually be disabled after 10 consecutive plugin errors. This can lead, for example in case of intermittent network issues, to disabling tasks permanently. In order to start tasks again plugin must be unloaded, loaded again and tasks restarted. Possible options to consider:
The text was updated successfully, but these errors were encountered: