Just-Round

Supplementary code for "Just Round: Quantized Observation Spaces Enable Memory Efficient Learning of Dynamic Locomotion"

Installation

Install stable_baselines3 from https://github.com/DLR-RM/stable-baselines3
Install mujoco-py from https://github.com/openai/mujoco-py
Run python3 setup.py install or python3 setup.py develop

Example

PPO

import gym
from stable_baselines3 import PPO

from just_round import QuantizedRolloutBuffer

env = gym.make("Walker2d-v3")

# For PPO, need to explicitly initialize rollout_buffer
model = PPO("MlpPolicy", env, verbose=1)
model.rollout_buffer = QuantizedRolloutBuffer(
    model.n_steps,
    model.observation_space,
    model.action_space,
    device=model.device,
    gamma=model.gamma,
    gae_lambda=model.gae_lambda,
    n_envs=model.n_envs,
    dec=1,
)
model.learn(total_timesteps=100000)

SAC

import gym
from stable_baselines3 import SAC

from just_round import QuantizedReplayBuffer

env = gym.make("Walker2d-v3")

# For SAC, can make use of the replay_buffer_class initialization argument
model = SAC("MlpPolicy", env, verbose=1, replay_buffer_class=QuantizedReplayBuffer)
model.learn(total_timesteps=10000)

Citing

To cite this work in your research, please use the following bibtex:

@inproceedings{grossman23JustRound,
  author = {Grossman, Lev and Plancher, Brian},
  title = {Just Round: Quantized Observation Spaces Enable Memory Efficient Learning of Dynamic Locomotion},
  booktitle={IEEE International Conference on Robotics and Automation (ICRA)},
  address = {London, UK},
  month={May.},
  year = {2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
just_round		just_round
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Just-Round

Installation

Example

PPO

SAC

Citing

About

Releases

Packages

Contributors 2

Languages

License

A2R-Lab/Just-Round

Folders and files

Latest commit

History

Repository files navigation

Just-Round

Installation

Example

PPO

SAC

Citing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages