-
Notifications
You must be signed in to change notification settings - Fork 27.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[TODO] Investigate equivalence tests #16497
Comments
Another one to add to this list: |
Thanks. @stas00 also reported this. I will take a look~ |
(just for the record) Among
(so > 25% to get close to |
@ydshieh I believe you can add the |
I am afraid I will completely forget this issue. But if this brother you guys, OK for me. Thanks for the tip, I didn't know about it |
(add a lot of assignees just to make you informed and kept updated in the future. Don't hesitate to remove yourself if you think it's irrelevant)
Currently the PT/TF/Flax equivalence tests use
1e-5
as the tolerance for the absolute differences of outputs.We see that these tests failed with a non-negligible (although not carefully defined) frequency.
Create this page to track a list of models to investigate.
FlaxWav2Vec2ModelTest (2.2888184e-05 > 1e-5)
TFGPT2EncoderDecoderModelTest (0.001009281724691391 > 1e-3)
1e-3
)5e-8
:not very sure why this doesn't get much larger.encoder_extended_attention_mask
:1e-4
vs1e-9
.TFViTMAEModelTest (1.013279e-05 > 1e-5)
The text was updated successfully, but these errors were encountered: