made coin game compatible with iql_rnn #107

Dronie · 2024-07-25T13:52:58Z

Slightly changed the way observations, rewards, dones and infos are processed/returned to make coin game compatible with iql_rnn (and potentially others but this is the only algorithm I tested).
Also added line 352 which allows for shared rewards (each agent's reward becomes the sum of their individual rewards) but not yet integrated in a nice way.

See plots for training results (using same configs as in ql_rnn_mpe):

Also see gifs of trained policies in action (10 episodes each):
Default (individual) rewards:

Shared rewards:

NB: have not done unit tests as this is only a minor change to an existing environment - let me know if this is an issue!

amacrutherford · 2024-08-02T22:49:21Z

hey thanks for opening this :) we're busy with neurips rebuttals atm but will come back to this after!

amacrutherford · 2024-09-04T11:17:08Z

Hey! just took a look and it looks good but could you add a flag for shared vs individual rewards? @Dronie

Dronie · 2024-09-04T14:26:10Z

Hey! just took a look and it looks good but could you add a flag for shared vs individual rewards? @Dronie

Done :)

made coin game compatible with iql_rnn

ce57ce0

amacrutherford self-requested a review September 4, 2024 11:17

added flag for shared reward

8f3f033

amacrutherford approved these changes Sep 9, 2024

View reviewed changes

amacrutherford merged commit 699e9a0 into FLAIROx:main Sep 9, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

made coin game compatible with iql_rnn #107

made coin game compatible with iql_rnn #107

Dronie commented Jul 25, 2024 •

edited

Loading

amacrutherford commented Aug 2, 2024

amacrutherford commented Sep 4, 2024 •

edited

Loading

Dronie commented Sep 4, 2024 •

edited

Loading

made coin game compatible with iql_rnn #107

made coin game compatible with iql_rnn #107

Conversation

Dronie commented Jul 25, 2024 • edited Loading

amacrutherford commented Aug 2, 2024

amacrutherford commented Sep 4, 2024 • edited Loading

Dronie commented Sep 4, 2024 • edited Loading

Dronie commented Jul 25, 2024 •

edited

Loading

amacrutherford commented Sep 4, 2024 •

edited

Loading

Dronie commented Sep 4, 2024 •

edited

Loading