-
Notifications
You must be signed in to change notification settings - Fork 8.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Contact forces are zero in Ant-v2/v3 #1541
Comments
The same goes for the Humanoid environment - contact forces are zero in the case of MuJoCo 2.0 . The environment uses (v2 and v3) the
Due to this Humanoid and Ant reward structure differs between MuJoCo 2.0 and <2.0 . Is it intentional? |
Also, found this very frustrating. |
Thanks for reporting this bug! It looks like this could have been introduced by #1401 Can you guys verify that using |
Thanks for your response! Using I see two other options to not to lose MuJoCo updates:
|
I don't know if we are supporting two different versions of mujoco for any particular reason. I think I'd rather just stick to the old version of there is no compelling reason to upgrade. Any objections? |
I see several possible issues with this decision:
|
gym doesn't depend on any particular mujoco version, just the gym mujoco environments do, so this should not affect other uses of gym + mujoco 2.0. |
This means you can't use the gym-mujoco environments with mujoco 2.0 (unless you override the requirements) but they don't really work with mujoco 2.0 anyway. Users of Thanks for the bug report @pratikac! |
@christopherhesse Sorry for the late question, but I just want to make sure I understand the implication of research that has used MuJoCo 2.0 for these environments (Ant-v2, Hopper-v2, HalfCheetah-v2, etc). Does this issue mean that results and learning curves in papers which use those -v2 environments cannot be considered trusted, reliable, or interesting? |
I'd say that those results where clearly obtained under different conditions and so comparisons with others that use the previous versions of MuJoCo are pointless. However, comparisons between methods on the same environments should be as valid as other comparisons. In fact, these environments could be harder, since less information is provided to the agent. |
@tarod13 That's a good point. My main concern would be if we see a set of papers that show:
But, under MuJoCo 1.5, algorithm X is actually still better than Y. I guess this might be unavoidable, though. |
@tarod13 @DanielTakeshi |
@brickerino Yes, you are right. However, different versions of MuJoCo make the v2 and v3 environments provide different information. That's why I say that they are not comparable if the MuJoCo version is different and also why using MuJoCo 2.0 may result in ultimately harder environments, while they are actually the same environments. |
Refer to openai/gym#1541 for details.
The contact forces are all zero in the the MuJoCo Ant-v2/v3 environments. If one runs
the last 84 dimensions in the state out of 111 dimensions are always zero. These dimensions are the contact forces in ant_v3.py (https://github.com/openai/gym/blob/master/gym/envs/mujoco/ant_v3.py#L124). Since the ant is in contact with the ground, I believe there should be some contact forces that are non-zero. Can you please clarify if this is the expected behavior?
The text was updated successfully, but these errors were encountered: