[Question] [Improvement] CUDA vectorized environment #772

AlessandroZavoli · 2022-02-14T19:17:38Z

I recently come across a paper about Isaac Gym by NVIDIA https://arxiv.org/pdf/2108.10470.pdf
They suggest that massive parallelization of the environment computation on the GPU can improve RL training, with respect to mixed CPU-GPU architectures (due to bottleneck in data transfer from host to device and vice versa). NVIDIA declares that researchers can achieve the same level of success as OpenAI’s supercomputer on a single GPU in about 10 hours.

The idea seems sound. A tentative implementation within Stable Baselines using Numba Kernels has been reported in a short undergraduate project report
https://www.sihao.dev/2021/05/21/increasing_sample_throughput_for_rl_environments_using_cuda/
Yet, I suspect that this implementation may have some pitfalls, as the training results are strange/questionable

Is someone interested in exploring this idea and/or suggesting how to implement a "CudaVecEnv" correctly?

Miffyli · 2022-02-14T21:04:56Z

Related issue #314

With high batchsize, you can get some of the boost of Isaac Gym by just doing the CudaVecEnv (VecEnv that, underneath, just calls the Isaac Gym's already-parallel envs).

To get the full boost, the problem is modifying all the buffers to be CUDA (or PyTorch) tensors and interconnecting it with Isaac Gym, which would not be a very backwards compatible change or at least would include bunch of code for switching between buffers, and at the very least they need to be done in pytorch instead of numpy arrays.

AlessandroZavoli · 2022-02-14T22:01:26Z

Thank you for your answer.
I see the point related to those buffers and the fact that a full boost cannot be obtained.

Yet, I was just wondering how to create that CudaVecEnv(VecEnv) class, that can be applied to a generic environment where the system dynamics (i.e., the step function) is just some algebraic function (e.g., cart-pole or rocket landing)
Maybe there is some example of reimplementing VecEnv that I can follow to better understand who to do it?
Or do you have some further advice?

Miffyli · 2022-02-14T22:59:51Z

Hmm unfortunately there are no docs on creating VecEnvs, but you can copy the DummyVecEnv code and start modifying from there. The major difference to normal Gym Envs is that step and reset etc return multiple observations (and take in multiple actions) at the same time :)

araffin · 2022-02-23T10:49:40Z

Yet, I was just wondering how to create that CudaVecEnv(VecEnv) class, that can be applied to a generic environment where the system dynamics (i.e., the step function) is just some algebraic function (e.g., cart-pole or rocket landing)

for isaac gym, you can take a look at this wrapper (adapter is here) and their vec env implementation

You should also take a look a env pool (only on cpu, but quite fast though), we have a wrapper to use it with SB3: https://github.com/sail-sg/envpool/blob/master/examples/sb3_examples/ppo.py

araffin · 2022-04-11T16:36:02Z

closing as some VenvEnvWrapper exist to handle GPU vectorized envs.

LyuJZ · 2023-11-27T13:34:08Z

Hi, thanks a lot for the well-documented stable baselines3. Now I am using Isaac Gym Preview4. May I ask if it is possible to give some examples to wrap IsaacGymEnvs into VecEnv? I noticed this issue was mentioned before. And some tips have been given in the issue #772. However, it seems it is for Isaac Gym Preview3. Could you give one example for Isaac Gym Preview 4?

Miffyli added the enhancement New feature or request label Feb 14, 2022

araffin added question Further information is requested and removed enhancement New feature or request labels Feb 23, 2022

araffin closed this as completed Apr 11, 2022

araffin mentioned this issue Apr 15, 2022

robo-gym check env issue #866

Closed

araffin mentioned this issue Dec 23, 2022

[Question] Custom made vectorized environment #1235

Closed

4 tasks

araffin mentioned this issue Nov 10, 2023

Support VecEnv for gymnasium.vector.VectorEnv and Brax #1745

Open

2 tasks

LyuJZ mentioned this issue Nov 27, 2023

IsaacGym Preview4 with Stable Baselines3? #1768

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] [Improvement] CUDA vectorized environment #772

[Question] [Improvement] CUDA vectorized environment #772

AlessandroZavoli commented Feb 14, 2022 •

edited

Loading

Miffyli commented Feb 14, 2022

AlessandroZavoli commented Feb 14, 2022

Miffyli commented Feb 14, 2022

araffin commented Feb 23, 2022

araffin commented Apr 11, 2022

LyuJZ commented Nov 27, 2023

[Question] [Improvement] CUDA vectorized environment #772

[Question] [Improvement] CUDA vectorized environment #772

Comments

AlessandroZavoli commented Feb 14, 2022 • edited Loading

Miffyli commented Feb 14, 2022

AlessandroZavoli commented Feb 14, 2022

Miffyli commented Feb 14, 2022

araffin commented Feb 23, 2022

araffin commented Apr 11, 2022

LyuJZ commented Nov 27, 2023

AlessandroZavoli commented Feb 14, 2022 •

edited

Loading