Making TF BART-like models XLA and AMP compliant #10191

jplu · 2021-02-15T14:44:15Z

What does this PR do?

This PR makes the TF BART-like models compliant with AMP and XLA. The main issue for XLA was all the asserts, XLA is not compliant with them (see the TF doc), so I had to disable them if the model is run with another mode than eager.

TF Marian and Pegasus have still their XLA test locked because they are not working for XLA_GPU. I need to investigate more in order to better understand why. My first guess is because of the TFXSinusoidalPositionalEmbedding class.

jplu · 2021-02-15T15:53:05Z

I succeed to fix Marian and Pegasus, and my first guess was the good one. I basically reworked a bit how the embedding was created, and now it works in XLA_GPU. Of course, all the corresponding slow tests are passing, and the weights are properly loaded.

LysandreJik

Great, LGTM!

LysandreJik · 2021-02-16T13:10:30Z

src/transformers/models/marian/modeling_tf_marian.py

-class TFMarianSinusoidalPositionalEmbedding(tf.keras.layers.Embedding):
+class TFMarianSinusoidalPositionalEmbedding(tf.keras.layers.Layer):


This was causing trouble to XLA?

Yes, more precisely, XLA on GPU. This is because the weights with the tf.keras.layers.Embeddings are initialized on CPU and the model is run on GPU, in XLA you cannot access from one to the other.

This is because the embeddings are created in the __init__ of the classes instead of being created in the build().

LysandreJik · 2021-02-16T13:10:52Z

tests/test_modeling_tf_mbart.py

+    def test_saved_model_creation(self):
+        # This test is too long (>30sec) and makes fail the CI
+        pass
+


sgugger

Nice! Thanks a lot for fixing those!

patrickvonplaten

Cool!

jplu requested review from patrickvonplaten, sgugger and LysandreJik February 15, 2021 16:18

jplu added 13 commits February 15, 2021 19:17

Update BART

6c19e0c

Update Blenderbot

0ad161d

Update BlenderbotSmall

45c9350

Update Marian

b6b6735

Update MBart

7e3614b

Update MBart

e124012

Update Pegasus

7c78bce

Update template

4bcbe34

Fix Marian and Pegasus

8bf123f

Apply style

d536e9a

Default initializer

8e481f6

Default initializer

dba2ad1

Default initializer

dfb3d5e

jplu force-pushed the tf-bart-xla-amp branch from 0940def to dfb3d5e Compare February 15, 2021 18:17

jplu added 2 commits February 16, 2021 13:18

Remove int32 casts

1e0983b

Fix template

500406e

LysandreJik approved these changes Feb 16, 2021

View reviewed changes

LysandreJik reviewed Feb 16, 2021

View reviewed changes

sgugger approved these changes Feb 16, 2021

View reviewed changes

Remove more cast

feba510

patrickvonplaten approved these changes Feb 17, 2021

View reviewed changes

jplu merged commit 83d803b into huggingface:master Feb 17, 2021

jplu deleted the tf-bart-xla-amp branch February 17, 2021 17:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Making TF BART-like models XLA and AMP compliant #10191

Making TF BART-like models XLA and AMP compliant #10191

jplu commented Feb 15, 2021

jplu commented Feb 15, 2021

LysandreJik left a comment

LysandreJik Feb 16, 2021

jplu Feb 16, 2021

LysandreJik Feb 16, 2021

sgugger left a comment

patrickvonplaten left a comment

		class TFMarianSinusoidalPositionalEmbedding(tf.keras.layers.Embedding):
		class TFMarianSinusoidalPositionalEmbedding(tf.keras.layers.Layer):

Making TF BART-like models XLA and AMP compliant #10191

Making TF BART-like models XLA and AMP compliant #10191

Conversation

jplu commented Feb 15, 2021

What does this PR do?

jplu commented Feb 15, 2021

LysandreJik left a comment

Choose a reason for hiding this comment

LysandreJik Feb 16, 2021

Choose a reason for hiding this comment

jplu Feb 16, 2021

Choose a reason for hiding this comment

LysandreJik Feb 16, 2021

Choose a reason for hiding this comment

sgugger left a comment

Choose a reason for hiding this comment

patrickvonplaten left a comment

Choose a reason for hiding this comment