Dict Observation Space with MultiDiscrete Action Space Issue #1713
Labels
custom gym env
Issue related to Custom Gym Env
documentation
Improvements or additions to documentation
help wanted
Help from contributors is welcomed
🐛 Bug
Using a Dict or Tuple observation space and a MultiDiscrete action space together causes PPO or A2C models to fail. If you swap either space to be a box the issue is resolved. The environment will pass the checker without any issue but fail when setting up the model.
Code example
Relevant log output / Error message
System Info
Checklist
The text was updated successfully, but these errors were encountered: