Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
mdeib authored Aug 3, 2020
1 parent 5a20c55 commit 47da611
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion hw3/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -101,7 +101,7 @@ Just like in Vanilla Policy Gradient the larger batch size (in dark blue) does a
The actor-critic algorithm is then applied to the half-cheetah environment:

```commandline
$ python cs285/scripts/run_hw3_actor_critic.py --env_name HalfCheetah-v2 --ep_len 150 --discount 0.90 --n 150 -l 2 -s 32 -b 30000 -eb 1500 -lr 0.02 --exp_name 10_10 -ntu 10 -ngsptu 10
$ python cs285/scripts/run_hw3_actor_critic.py --env_name HalfCheetah-v2 --ep_len 150 --discount 0.90 -n 150 -l 2 -s 32 -b 30000 -eb 1500 -lr 0.02 --exp_name 10_10 -ntu 10 -ngsptu 10
```

A run of VPG with the same hyper-parameters (where applicable) from HW2 is included in order to compare the two methods. The chart is below, with VPG in orange and AC in blue:
Expand Down

0 comments on commit 47da611

Please sign in to comment.