Why is success not a criterion in the reward function? #508

LeroyGitBash · 2024-09-13T13:54:58Z

LeroyGitBash
Sep 13, 2024

I'm trying to understand why the authors didn't include the success criterion in the reward function.

Consider the following scenario: I trigger termination as soon as success is reached. In this case, every agent would be incentivised not to reach the success point, since doing so would reduce the overall reward.

This makes sense - if the reward is higher if the agent stays close to the success point without entering it (to avoid termination), agents will prefer to stay close rather than complete the task.

Does anyone have a good suggestion for how I can make reaching the success point more rewarding in all 50 environments without having to change each one manually?

Answered by reginald-mclean

Sep 13, 2024

It is included in the reward function, for certain environments. An example is in assembly

View full answer

reginald-mclean · 2024-09-13T18:28:20Z

reginald-mclean
Sep 13, 2024
Maintainer

It is included in the reward function, for certain environments. An example is in assembly

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why is success not a criterion in the reward function? #508

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Why is success not a criterion in the reward function? #508

LeroyGitBash Sep 13, 2024

Replies: 1 comment

reginald-mclean Sep 13, 2024 Maintainer

LeroyGitBash
Sep 13, 2024

reginald-mclean
Sep 13, 2024
Maintainer