RL Gear

This project makes setting up new research projects with ray a bit more turn-key.

Installation

rlgear is designed to work with ray version 0.8.7 or later and has been tested with python 3.6.

First install tensorflow and pytorch. cpu-only tensorflow is fine since rlgear doesn't use any neural network operations from tensorflow.

Second, you may need to install opencv since ray/rllib/env/atari_wrappers.py requires the cv2 package but opencv is not installed automatically with ray. It can be installed on Ubuntu with apt install python3-opencv.

Thid, install the package.

    pip install .

See the Dockerfile for a minimal example of how to install on Ubuntu 18.04. To build it locally, run

    docker build -t rlgear:latest .

Usage

See tests/test_train_cartpole.py for a minimal working example.

Features

Canonical networks

Common networks such as DQN and IMPALA are implemented in pytorch as well as a fully connected network that has separate networks for the value and policy. There is also a helper class to reduce boilerplate code for feedforward networks. See torch_models.py.

Setting Up Experiments

Import yaml files from other yaml files to adjust a small portion for a new experiment or save meta data from an experiment (git info, requirements.txt, etc). See utils.py and rllib_utils.py)

Tensorboard Plotting

After running an experiment multiple times, plot it in matplotlib with transparent percentiles. See scripts.py and utils.py

Troubleshooting

Restoring PyTorch models

When restoring a pytorch model with ray>=0.8.6 there is an error that can arise when workers using only a cpu:

  RuntimeError: Attempting to deserialize object on a CUDA device but
  torch.cuda.is_available() is False. If you are running on a CPU-only machine,
  please use torch.load with map_location=torch.device('cpu') to map your
  storages to the CPU.

A short-term fix suggested here is to change the pytorch code in torch/storage.py as follows:

    def _load_from_bytes(b):
        if torch.cuda.is_available():
            return torch.load(io.BytesIO(b))
        else:
            return torch.load(io.BytesIO(b), map_location=torch.device('cpu'))

DistributionErrors for opencv

You may get the following error:

  pkg_resources.DistributionNotFound: The 'opencv-python-headless<=4.3.0.36;
  extra == "rllib"' distribution was not found and is required by ray

As described here, you need to install opencv-python-headless==4.3.0.36.

License

BSD-3-Clause

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
.github		.github
rlgear		rlgear
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
.travis.yml		.travis.yml
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL Gear

Installation

Usage

Features

Canonical networks

Setting Up Experiments

Tensorboard Plotting

Troubleshooting

Restoring PyTorch models

DistributionErrors for opencv

License

About

Releases

Packages

Contributors 2

Languages

License

esquires/rlgear

Folders and files

Latest commit

History

Repository files navigation

RL Gear

Installation

Usage

Features

Canonical networks

Setting Up Experiments

Tensorboard Plotting

Troubleshooting

Restoring PyTorch models

DistributionErrors for opencv

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages