-
Notifications
You must be signed in to change notification settings - Fork 27.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix missing test in torch_job
#33593
Conversation
cc @ArthurZucker for reference |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for having a look and finding the root cause 🙏
(the failing test seems related to #33533 cc @zucchini-nlp ) |
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
Those are a bit flaky, in no-cache settings. Since the weights are random, we can generate image tokens (it's not oov anymore) and then at some point fail to get enough image embeddings. Do you think we should overwrite those for VLMs for be always with cache? @gante imo, not a big deal, for me it never failed locally until I got to CI runs |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good catch, was missing some of them indeed!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👀 Thanks for catching and fixing this!
TL;DR
@zucchini-nlp If I got it right, the error is caused by a generation-time behavior that doesn't exist in pre-trained models. This reminds me of Whisper, which has a bunch of The correct long-term fix should then be to parameterize the model (as in the model class, not the tester) to have a |
OK, I will rebase once #33602 is merged. Should I also wait for a fix for tests/models/video_llava/test_modeling_video_llava.py::VideoLlavaForConditionalGenerationModelTest::test_sample_generate_dict_output ? |
@ydshieh working on it now (cc @zucchini-nlp, who is off for the next few days 🤗 ) |
@ydshieh rebasing now should get rid of the red CI 🙏 |
3accbdb
to
e4d4429
Compare
fix missing tests Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
fix missing tests Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
fix missing tests Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
fix missing tests Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
What does this PR do?
Currently we have
and
(or any model test class)
plus
in CircleCI config.
So
torch_job
won't run tests which is marked asgenerate
, which are all tests as any model test class inherits fromGenerationTesterMixin
.This PR fixes it