📄 Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation

The official repository of our [Paper] at ICML 2024.

COREP primarily employs a dual-GAT structure with a guided updating mechanism to learn a stable graph representation for states, termed as causal origin representation. By leveraging this representation, the learned policy exhibits resilience to non-stationarity. The overall framework of COREP is illustrated in the above figure.

Install

Clone this repository and navigate to LLaVA folder

git clone https://github.com/PKU-RL/COREP.git
cd COREP

Install Package

conda create -n COREP python=3.10 -y
conda activate COREP
pip install --upgrade pip
pip install -r requirements.txt

Install additional packages. We use the dmc2gym library to convert the deepmind control environment into an openai gym interface.

pip install -e dmc2gym

Running experiments

To evaluate COREP on the deepmind control environment, run

python main.py --env-type <env_name>

which will use hyperparameters from config/args_<env_name>.py.

To reproduce the results in the paper, run the following commands:

python main.py --env-type cartpole_swingup
python main.py --env-type reacher_easy
python main.py --env-type reacher_hard
python main.py --env-type cup_catch
python main.py --env-type cheetah_run
python main.py --env-type hopper_stand
python main.py --env-type swimmer_swimmer6
python main.py --env-type swimmer_swimmer15
python main.py --env-type finger_spin
python main.py --env-type walker_walk
python main.py --env-type fish_upright
python main.py --env-type quadruped_walk

Results

The results will be saved at ./logs, to view the results on tensorboard run

tensorboard --logdir ./logs

Acknowledgements

Parts of the code are based on the VariBAD repository, which we have modified to implement COREP.

Citation

If you find our work useful in your research and would like to cite our project, please use the following citation:

@InProceedings{zhang2024tackling,
  title={Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation},
  author={Zhang, Wanpeng and Li, Yilin and Yang, Boyu and Lu, Zongqing},
  booktitle={Proceedings of the 41st International Conference on Machine Learning},
  pages={59264--59288},
  year={2024},
  volume={235},
  publisher={PMLR}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
algorithms		algorithms
config		config
dmc2gym		dmc2gym
environments		environments
imgs		imgs
models		models
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
graph.py		graph.py
learner.py		learner.py
main.py		main.py
requirements.txt		requirements.txt
vae.py		vae.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📄 Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation

Install

Running experiments

Results

Acknowledgements

Citation

About

Releases

Packages

Languages

License

PKU-RL/COREP

Folders and files

Latest commit

History

Repository files navigation

📄 Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation

Install

Running experiments

Results

Acknowledgements

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages