Patch for gym version 0.21.0 bug #224

jxtrbtk · 2023-02-27T10:27:00Z

I came across this bug from gym: openai/gym#2752
When time limit is triggered, the output is a bool instead of a dict like {'player_0': False, 'player_1': False}

This produces the following error randomly. It appears sooner or later during the training.

~/.conda/envs/lux_ai_s2/lib/python3.7/site-packages/gym/wrappers/time_limit.py in step(self, action)
16 self._elapsed_steps is not None
17 ), "Cannot call env.step() before calling reset()"
---> 18 observation, reward, done, info = self.env.step(action)
19 self._elapsed_steps += 1
20 if self._elapsed_steps >= self._max_episode_steps:

/tmp/ipykernel_13436/3060603165.py in step(self, action)
57 obs, _, done, info = self.env.step(action)
58 obs = obs[agent]
---> 59 done = done[agent]
60 # if type(done) == type({}): done = done[agent]
61 # elif type(done) == type(True): done = {agent: done, opp_agent: False}

TypeError: 'bool' object is not subscriptable

I sugest this patch to avoid that.

I came across this bug in gym: openai/gym#2752 When time limit is triggered, the output is a bool instead of a dict like {'player_0': False, 'player_1': False} This produces the following error randomly. It appears sooner or later during the training. ~/.conda/envs/lux_ai_s2/lib/python3.7/site-packages/gym/wrappers/time_limit.py in step(self, action) 16 self._elapsed_steps is not None 17 ), "Cannot call env.step() before calling reset()" ---> 18 observation, reward, done, info = self.env.step(action) 19 self._elapsed_steps += 1 20 if self._elapsed_steps >= self._max_episode_steps: /tmp/ipykernel_13436/3060603165.py in step(self, action) 57 obs, _, done, info = self.env.step(action) 58 obs = obs[agent] ---> 59 done = done[agent] 60 # if type(done) == type({}): done = done[agent] 61 # elif type(done) == type(True): done = {agent: done, opp_agent: False} TypeError: 'bool' object is not subscriptable I sugest this patch to avoid that.

Patch for gym version 0.21.0 bug

netlify · 2023-02-27T10:27:08Z

✅ Deploy Preview for lux-eye-s2 canceled.

Name	Link
🔨 Latest commit	`84e5058`
🔍 Latest deploy log	https://app.netlify.com/sites/lux-eye-s2/deploys/63fc85767b70520008bde07f

StoneT2000 · 2023-02-27T17:53:57Z

This is fixed in the latest version of Lux actually. The bug is that the time limit wrapper was around the inner most env instead of outer.

What version of lux are you on?

jxtrbtk · 2023-02-27T18:49:10Z

2.1.0
-> to me seems unchanged on that part since that version

StoneT2000 · 2023-02-28T01:45:07Z

@jxtrbtk please update to v2.1.9

jxtrbtk · 2023-02-28T13:14:30Z

It seems to be fixed after updating to 2.1.9. Can't be 100% sure as it was sometimes happening after a full night of training and millions of steps. But I have some settings where it was triggering always after only a dozen of minutes and that I used to reproduce this easily. And for this one, it's now passing OK.
Thanks !

jxtrbtk added 2 commits February 27, 2023 11:18

Merge pull request #1 from jxtrbtk/jxtrbtk-patch-1

84e5058

Patch for gym version 0.21.0 bug

StoneT2000 closed this Feb 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Patch for gym version 0.21.0 bug #224

Patch for gym version 0.21.0 bug #224

jxtrbtk commented Feb 27, 2023

netlify bot commented Feb 27, 2023 •

edited

Loading

StoneT2000 commented Feb 27, 2023

jxtrbtk commented Feb 27, 2023 •

edited

Loading

StoneT2000 commented Feb 28, 2023

jxtrbtk commented Feb 28, 2023

Patch for gym version 0.21.0 bug #224

Patch for gym version 0.21.0 bug #224

Conversation

jxtrbtk commented Feb 27, 2023

netlify bot commented Feb 27, 2023 • edited Loading

✅ Deploy Preview for lux-eye-s2 canceled.

StoneT2000 commented Feb 27, 2023

jxtrbtk commented Feb 27, 2023 • edited Loading

StoneT2000 commented Feb 28, 2023

jxtrbtk commented Feb 28, 2023

netlify bot commented Feb 27, 2023 •

edited

Loading

jxtrbtk commented Feb 27, 2023 •

edited

Loading