[Feature Request] ACME Integration #60

Trinkle23897 · 2022-02-17T02:14:49Z

https://github.com/deepmind/acme

Road Map:

@TianyiSun316

Go through ACME codebase and integrate vector_env to the available algorithms;
Write Atari examples;
Check Atari performance: Pong and Breakout;
Submit PR;

@LeoGuo98

Do some experiments with sample efficiency (actually you can try out with different libraries, either ACME, tianshou, or sb3, this doesn't depend on the previous item)

Resources:

tianshou: #51
stable-baselines3: #39
cleanrl: #48 #53

cc @zhongwen

zhongwen · 2022-04-28T01:32:40Z

Use Acme JAX agents intead of the TF agents

References:
https://github.com/deepmind/acme/tree/master/acme/agents/jax

Example scripts, you can just replace it with a dqn agent.

https://github.com/deepmind/acme/blob/master/examples/atari/run_impala.py

https://github.com/deepmind/acme/blob/master/examples/atari/run_r2d2.py

TianyiSun316 · 2022-05-07T06:03:15Z

Use Acme JAX agents intead of the TF agents

References: https://github.com/deepmind/acme/tree/master/acme/agents/jax

Example scripts, you can just replace it with a dqn agent.

https://github.com/deepmind/acme/blob/master/examples/atari/run_impala.py

https://github.com/deepmind/acme/blob/master/examples/atari/run_r2d2.py

I tried but seems like JAX DQN network is not usable. Current version is based on TensorFlow. Maybe we can discuss this offline?

zhongwen · 2022-05-09T04:59:56Z

Sure!

zhongwen · 2022-05-10T02:23:04Z

Let's use R2D2 example first, since Tianyi has significant difficutlites and spent little time to understand the acme codebase.

zhongwen · 2022-05-10T02:24:34Z

Steps:

Implement a new EnvironmentLoop to integrate with EnvPool;
Impelement a new adder to integrate with batched inputs;
Integrate with the R2D2 example
Experiments

zhongwen · 2022-05-12T01:45:57Z

How is the progress? You have only 1 day left for the task.

TianyiSun316 · 2022-05-12T03:08:31Z

Let's use R2D2 example first, since Tianyi has significant difficutlites and spent little time to understand the acme codebase.

I was eager to know your solution of using JAX DQN to write the example when we discuss offline and there are several things I forgot to mention during the offline discussion:

I have already finished step 1 in your "Steps", you will find it in my PR Add acme JAX R2D2 example #104
Using JAX DQN is more complex than the example scripts you mentioned above. There is no pre-written DQN network that we can use directly. We may need to write functions like make_atari_networks to get a usable DQN network, while the example scripts simply called the API that has already been implemented.
Implementing R2D2 is also more complex than the example scripts since the R2D2 agent expects the observation to use the OAR struct. Our EnvWrapper needs to adapt to it accordingly.

I have finished the example using JAX R2D2 agents. Please review the code and let me know if there are any modifications needed.

Trinkle23897 assigned LeoGuo98 and TianyiSun316 Feb 17, 2022

Trinkle23897 added the enhancement New feature or request label Apr 27, 2022

TianyiSun316 mentioned this issue May 7, 2022

Add acme JAX R2D2 example #104

Closed

13 tasks

Trinkle23897 linked a pull request Jun 22, 2022 that will close this issue

Add acme example #157

Merged

16 tasks

Trinkle23897 closed this as completed in #157 Jun 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] ACME Integration #60

[Feature Request] ACME Integration #60

Trinkle23897 commented Feb 17, 2022 •

edited

Loading

zhongwen commented Apr 28, 2022

TianyiSun316 commented May 7, 2022

zhongwen commented May 9, 2022

zhongwen commented May 10, 2022

zhongwen commented May 10, 2022

zhongwen commented May 12, 2022

TianyiSun316 commented May 12, 2022

[Feature Request] ACME Integration #60

[Feature Request] ACME Integration #60

Comments

Trinkle23897 commented Feb 17, 2022 • edited Loading

zhongwen commented Apr 28, 2022

TianyiSun316 commented May 7, 2022

zhongwen commented May 9, 2022

zhongwen commented May 10, 2022

zhongwen commented May 10, 2022

zhongwen commented May 12, 2022

TianyiSun316 commented May 12, 2022

Trinkle23897 commented Feb 17, 2022 •

edited

Loading