-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CI: 9 tests failing with PyTorch 1.10(dev) #9280
Comments
I am a bit confused because this test seems to be testing Pytorch 1.10 but according to https://pytorch.org/docs/versions.html v1.10 is not released yet |
correct, we are building against the dev version to avoid any big breaking when the next version comes. |
@Borda I see, thanks! Update: after #9345 these are remaining failing tests:
|
Stack trace for
It seems quant_score is higher than it should be. When I run the test locally (Pytorch 1.9) quant_score = 0.5105 @Borda since you worked on this do you have any idea what might be the issue? If not should we check with Pytorch folks? |
as nothing changed with PL callback I would open parallel talk to Pytorch folks and Ill check it on PL side |
Sounds good, Pytorch issue created: pytorch/pytorch#64564 |
So as of latest commit on master we now have 30 tests failing in the Pytorch 1.10 env: https://github.com/PyTorchLightning/pytorch-lightning/runs/3571593663 Here is what I notice about this job:
Given this, what do people think about deleting this test? If this test is important, then we should make it required which will force us to keep it green, but I feel it is really harmful to have it constantly making our CI red, and don't think it is sustainable to keep fixing it all the time. cc @PyTorchLightning/core-contributors |
Testing nightly is useful to track these errors. However, you don't want to mark it as required to avoid upstream blocking us. Since GitHub doesn't allow having "informative" jobs, if we want to keep the CI green but not mark a nightly job as required, the only option I can think of is only enabling it when there's a release candidate available which should be more stable. |
we do not have this nightly versions as required 🐰 |
@carmocca This sounds like a good option! I just think it is very important that our CI is actually green, not that it is red but those who know it well know that it is effectively "green". But that any one visiting the repo, or a new contributor, sees that it is green. I think it makes a lot of sense to only enable the newest PT version when there's an RC available, which will be more stable as you said. Also this aligns well with the new release cycle which is synced with Pytorch's. So now since we won't be releasing PL1.5 until October 26, after PT1.10 is released, we can disable this test until closer to the release, and then focus on making it green if there are any outstanding issues. |
so shall we close this issue? :] |
yup! thanks for the help |
Discussed in #9273
Originally posted by daniellepintz September 2, 2021
I noticed in recent commits these 9 tests are consistently failing in the Conda 1.10 environment
(https://github.com/PyTorchLightning/pytorch-lightning/runs/3483032531)
Starting this thread to track this issue and try to resolve. Does anyone have any ideas? cc @Borda
The text was updated successfully, but these errors were encountered: