-
Notifications
You must be signed in to change notification settings - Fork 505
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[update to #992] Fix and refactor MoveObjectReward #1010
[update to #992] Fix and refactor MoveObjectReward #1010
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've tested this and it works as expected for me.
if place_success and not is_holding_obj: | ||
self._metric += self._config.single_rearrange_reward |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@SergioArnaud Is it possible to double count the place reward here if the task does not end after placing? I would add a if self._cur_rearrange_stage < self.num_targets
to make sure of it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@vincentpierre I think that case is considered in this condition. If all the objects are placed in the right place but for some reason we haven't succeded in the task (for example the agent hasn't called terminate) we give zero reward.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh, nice. sorry I missed it.
…acebookresearch#1010) * Fix and refactor MoveObjectReward * Fix and refactor MoveObjectReward - format * Fix and refactor MoveObjectReward - fix typo Co-authored-by: SergioArnaud <sercharnaud@gmail.com> Co-authored-by: vincentpierre <vincentpierre@users.noreply.github.com>
…acebookresearch#1010) * Fix and refactor MoveObjectReward * Fix and refactor MoveObjectReward - format * Fix and refactor MoveObjectReward - fix typo Co-authored-by: SergioArnaud <sercharnaud@gmail.com> Co-authored-by: vincentpierre <vincentpierre@users.noreply.github.com>
Motivation and Context
How Has This Been Tested
Types of changes
Checklist