Make (TF) CI faster (test only a random subset of model classes) #24592

ydshieh · 2023-06-30T10:29:11Z

What does this PR do?

Daily CI is currently running in 22h30m. @Rocketknight1 might have a way to bring it back to 19-20 hours.

For some tests, let's test only a (random) subset of the model classes 🙏 .

Here is the timing of some very slow tests currently:

398.44s call     tests/models/bert/test_modeling_tf_bert.py::TFBertModelTest::test_xla_fit
275.59s call     tests/models/bert/test_modeling_tf_bert.py::TFBertModelTest::test_saved_model_creation_extended
217.84s call     tests/models/bert/test_modeling_tf_bert.py::TFBertModelTest::test_compile_tf_model
106.25s call     tests/models/bert/test_tokenization_bert_tf.py::BertTokenizationTest::test_saved_model
77.69s call     tests/models/bert/test_modeling_tf_bert.py::TFBertModelTest::test_onnx_runtime_optimize

and

352.31s call     tests/models/bart/test_modeling_tf_bart.py::TFBartModelTest::test_saved_model_creation_extended
272.56s call     tests/models/bart/test_modeling_tf_bart.py::TFBartModelTest::test_compile_tf_model
270.84s call     tests/models/bart/test_modeling_tf_bart.py::TFBartModelTest::test_xla_fit
132.59s call     tests/models/bart/test_modeling_tf_bart.py::TFBartModelTest::test_onnx_runtime_optimize

HuggingFaceDocBuilderDev · 2023-06-30T10:53:35Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Let's not take a random subset but the first two then. To test the base model and a model with head.

Rocketknight1 · 2023-06-30T12:32:43Z

Some of the very slow tests (like test_saved_model_creation_extended and test_xla_fit) only apply to a few models anyway - they're in test_modeling_tf_core.py, so they shouldn't have a big effect on the total test runtime. I might have a couple of ideas for speeding up test_compile_tf_model, though!

ydshieh · 2023-06-30T13:09:30Z

Let's not take a random subset but the first two then. To test the base model and a model with head.

Would it be ok to take the first one (base model) + a random other one with head?

Rocketknight1 · 2023-06-30T13:45:25Z

Also, I looked a bit closer at this PR and I'm actually a bit scared of some of the changes - in particular, test_pt_tf_model_equivalence is one of the most important tests and picks up lots of implementation problems in TF ports, so I don't want to reduce its coverage!

ydshieh · 2023-06-30T13:52:30Z

@Rocketknight1

But that test is not changed, i.e. it doesn't use get_random_model_classes introduced here. Nothing to fear 😆

sgugger · 2023-06-30T13:54:17Z

Would it be ok to take the first one (base model) + a random other one with head?

I don't like randomness in tests as it makes them flaky.

ydshieh · 2023-06-30T14:00:34Z

Well, in this situation, I do prefer to keep a random head model.

We are reducing the number of model classes being tested due to the slow runtime. If we keep the fix model classes, we are likely to miss failures in certain model heads. (and for the involved tests in this PR, they all pass currently for their all model classes - if not, probably just one or two.)
~~Only slow tests are involved~~ --> no flakyness shown on CircleCI.
- Sorry, I am wrong in this. But I can change it to only for slow tests.

WDYT if I make changes only to slow tests?

sgugger · 2023-06-30T14:05:18Z

I very much doubt we will have a failure on a model with head and not the others. With the randomness in the test, you won't be able to reproduce easily (and I don't see the test even printing the model class that failed) so I'd keep things reproducible. This is also on TensorFlow which has very low usage, so I don't think it's worth spending too much time over-engineering something.

ydshieh · 2023-06-30T14:06:40Z

OKOK

ydshieh · 2023-06-30T14:40:32Z

@Rocketknight1 OK for you?

Rocketknight1

Yeah, I'm happy with this!

ydshieh added 2 commits June 30, 2023 12:12

fix

896990c

fix

fa36bf1

ydshieh changed the title ~~Make (TF) CI faster (by testing only a subset of model classes)~~ Make (TF) CI faster (test only a random subset of model classes) Jun 30, 2023

ydshieh requested review from Rocketknight1 and sgugger June 30, 2023 10:34

sgugger reviewed Jun 30, 2023

View reviewed changes

fix

de50699

sgugger approved these changes Jun 30, 2023

View reviewed changes

Rocketknight1 approved these changes Jun 30, 2023

View reviewed changes

ydshieh merged commit 3441ad7 into main Jun 30, 2023

ydshieh deleted the save_our_poor_ci branch June 30, 2023 14:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make (TF) CI faster (test only a random subset of model classes) #24592

Make (TF) CI faster (test only a random subset of model classes) #24592

ydshieh commented Jun 30, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 30, 2023 •

edited

Loading

sgugger left a comment

Rocketknight1 commented Jun 30, 2023 •

edited

Loading

ydshieh commented Jun 30, 2023 •

edited

Loading

Rocketknight1 commented Jun 30, 2023

ydshieh commented Jun 30, 2023 •

edited

Loading

sgugger commented Jun 30, 2023

ydshieh commented Jun 30, 2023 •

edited

Loading

sgugger commented Jun 30, 2023

ydshieh commented Jun 30, 2023

ydshieh commented Jun 30, 2023

Rocketknight1 left a comment

Make (TF) CI faster (test only a random subset of model classes) #24592

Make (TF) CI faster (test only a random subset of model classes) #24592

Conversation

ydshieh commented Jun 30, 2023 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Jun 30, 2023 • edited Loading

sgugger left a comment

Choose a reason for hiding this comment

Rocketknight1 commented Jun 30, 2023 • edited Loading

ydshieh commented Jun 30, 2023 • edited Loading

Rocketknight1 commented Jun 30, 2023

ydshieh commented Jun 30, 2023 • edited Loading

sgugger commented Jun 30, 2023

ydshieh commented Jun 30, 2023 • edited Loading

sgugger commented Jun 30, 2023

ydshieh commented Jun 30, 2023

ydshieh commented Jun 30, 2023

Rocketknight1 left a comment

Choose a reason for hiding this comment

ydshieh commented Jun 30, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 30, 2023 •

edited

Loading

Rocketknight1 commented Jun 30, 2023 •

edited

Loading

ydshieh commented Jun 30, 2023 •

edited

Loading

ydshieh commented Jun 30, 2023 •

edited

Loading

ydshieh commented Jun 30, 2023 •

edited

Loading