Skip to content

Commit 00aeff3

Browse files
authoredApr 20, 2018
Update README.md
1 parent 5366f0c commit 00aeff3

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed
 

‎README.md

+2-2
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# scalable_maddpg
2-
scalable multi agent reinforcement learning. Details can be found in the report [Report](scalable-multi-agent.pdf)
2+
scalable multi agent reinforcement learning. Details can be found in the [Report](scalable-multi-agent.pdf)
33

44
## to do list
55
- [x] tune L2, does LSTM parameters need L2 regulizer?
@@ -12,7 +12,7 @@ scalable multi agent reinforcement learning. Details can be found in the report
1212
- [ ] add another network for prey
1313
- [ ] add summary for rewards of each episode
1414
## Alternative to Gym
15-
An alternative of Gym environment is created (env.py). The rendering implementation of the envrionment is matplot. So, it would be much easier to use.
15+
An alternative of Gym environment is created (env.py). The rendering implementation of the envrionment is matplot. So, it would be much easier to use. However, you need to implement the prey policy by yourself.
1616
## Results
1717
Here, we have done two independent runs. In each run, from episode 1 to episodes 3x10^4, three agents were in the game. At episode 3x10^4, we added three more agents into this game. Here we show the mean Q value of all the agents in our experiments.
1818

0 commit comments

Comments
 (0)