Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

made coin game compatible with iql_rnn #107

Merged
merged 2 commits into from
Sep 9, 2024

Conversation

Dronie
Copy link
Contributor

@Dronie Dronie commented Jul 25, 2024

Slightly changed the way observations, rewards, dones and infos are processed/returned to make coin game compatible with iql_rnn (and potentially others but this is the only algorithm I tested).
Also added line 352 which allows for shared rewards (each agent's reward becomes the sum of their individual rewards) but not yet integrated in a nice way.

See plots for training results (using same configs as in ql_rnn_mpe):
image

Also see gifs of trained policies in action (10 episodes each):
Default (individual) rewards:
individual_rewards

Shared rewards:
shared_rewards

NB: have not done unit tests as this is only a minor change to an existing environment - let me know if this is an issue!

@amacrutherford
Copy link
Collaborator

hey thanks for opening this :) we're busy with neurips rebuttals atm but will come back to this after!

@amacrutherford
Copy link
Collaborator

amacrutherford commented Sep 4, 2024

Hey! just took a look and it looks good but could you add a flag for shared vs individual rewards? @Dronie

@amacrutherford amacrutherford self-requested a review September 4, 2024 11:17
@Dronie
Copy link
Contributor Author

Dronie commented Sep 4, 2024

Hey! just took a look and it looks good but could you add a flag for shared vs individual rewards? @Dronie

Done :)

@amacrutherford amacrutherford merged commit 699e9a0 into FLAIROx:main Sep 9, 2024
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants