-
Notifications
You must be signed in to change notification settings - Fork 357
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Learning from scratch without using pre-trained model #15
Comments
Hey, have you tried training it multiple times? or did you change the hyper-parameters? |
I am running the master branch test.py and PPO.py (I hope all the recent changes are pushed in these), and I ran the code as it is, Just commented on the "load_state_dict" line in the code, with no changes in hyperparameters. These are some of the rewards I am getting. Episode: 14994 Reward: -51 Also, I am using this version of code with a small grid world environment, and it does not seem to learn at all there as well. |
Ahh...I see, The Also, I ran some tests now on the Lunar Lander env and it seems to train just fine. |
Ohh, my bad. |
I tried running test.py (PPO.py) from scratch on LunarLander-v2 Environment, without using the pre-trained model, but it does not seem to learn till 15000episodes. The episodic returns are negative even after 15000 episodes. How many episodes did it take to get the trained model?
The text was updated successfully, but these errors were encountered: