You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
That functions returns observation, reward, terminated, and info, which all makes sense, but then there is that False that I don't understand. What's the purpose of that? What am I missing?
The text was updated successfully, but these errors were encountered:
what @axb2035 wrote is mostly correct as the environment doesn't internally truncate, i.e. stop an episode due to a time limit.
However, during gym.make, a time limit wrapper is applied to the environment that can enforce a time limit and truncate the environment. For all mujoco environments this time limit is 1000 steps.
Question
This is code directly copied from this repo for the halfcheetah environment v4, specifically this
That functions returns
observation
,reward
,terminated
, andinfo
, which all makes sense, but then there is thatFalse
that I don't understand. What's the purpose of that? What am I missing?The text was updated successfully, but these errors were encountered: