Reinforce Tianshou

My implementation of REINFORCE RL algorithm with the help of tianshou reinforcement learning framework. This builds on top of my implementation using only PyTorch here: https://github.com/drozzy/reinforce

The code trains an agent to solve a CartPole-v0 environment and then renders a few episodes with a trained agent.

Video Explanation

The reinforce_tianshou.py is the complete example.
The intermediate_steps/reinforce_tianshou_no_trainer.py shows how things would look without a trainer.
The intermediate_steps/reinforce_tianshou_no_trainer_no_net.pyshows things without a trainer and a built-in network.
The intermediate_steps/reinforce_tianshou_no_net.py shows how to create a custom network and a custom policy, while using built-in trainer.
The slides_code/policy_component.py - shows an example of calling a built-in policy on an observation from CartPole environment.

conda env create -f environment.yml
conda activate reinforce_tianshou
pip install -r requirements.txt

python reinforce_tianshou.py

--Andriy Drozdyuk

Name	Name	Last commit message	Last commit date
Latest commit drozzy Update README.md Jul 30, 2022 1f7752a · Jul 30, 2022 History 13 Commits
intermediate_steps	intermediate_steps	save fix to optim	Nov 17, 2021
slides_code	slides_code	added policy slides	Nov 17, 2021
.gitignore	.gitignore	Save	Jul 11, 2021
README.md	README.md	Update README.md	Jul 30, 2022
environment.yml	environment.yml	Save	Jul 11, 2021
reinforce_tianshou.py	reinforce_tianshou.py	save fix to optim	Nov 17, 2021
requirements.txt	requirements.txt	Save	Jul 11, 2021