Add FlaxWhisperForAudioClassification model #23173

raghavanone · 2023-05-05T15:58:48Z

Fixes #21779

HuggingFaceDocBuilderDev · 2023-05-05T16:13:43Z

The documentation is not available anymore as the PR was closed or merged.

sgugger · 2023-05-05T16:14:36Z

cc @sanchit-gandhi

sgugger · 2023-05-05T16:15:03Z

The test failures are appearing on this one. Let's fix them and re-merge!

sanchit-gandhi · 2023-05-05T16:31:10Z

We need to make two changes following the updates in #22954! First, we need to assign the attribute gradient_checkpointing to the class FlaxWhisperForAudioClassificationModule, similar to what we do for FlaxWhisperForConditionalGeneration:

transformers/src/transformers/models/whisper/modeling_flax_whisper.py

Line 1176 in a5741d7

gradient_checkpointing: bool = False

We then need to forward self.gradient_checkpointing to the encoder:

-         self.encoder = FlaxWhisperEncoder(config=self.config, dtype=self.dtype)
+         self.encoder = FlaxWhisperEncoder(config=self.config, dtype=self.dtype, gradient_checkpointing=self.gradient_checkpointing)

This will facilitate gradient checkpointing for the module!

raghavanone · 2023-05-05T17:15:25Z

@sgugger @sanchit-gandhi Done, all tests pass !

sgugger

Thnaks a lot!

sanchit-gandhi · 2023-05-09T15:54:48Z

src/transformers/models/whisper/modeling_flax_whisper.py

@@ -1512,6 +1512,7 @@ def update_inputs_for_generation(self, model_outputs, model_kwargs):
 class FlaxWhisperForAudioClassificationModule(nn.Module):
    config: WhisperConfig
    dtype: jnp.dtype = jnp.float32
+    gradient_checkpointing: bool = False

    def setup(self) -> None:
        self.encoder = FlaxWhisperEncoder(config=self.config, dtype=self.dtype)


Hey @raghavanone! Sorry I didn't get the chance to re-review the last changes before merge, there's one small change we need in this line to forward the gradient checkpointing attribute to the encoder (see #23173 (comment)):

Suggested change

self.encoder = FlaxWhisperEncoder(config=self.config, dtype=self.dtype)

self.encoder = FlaxWhisperEncoder(config=self.config, dtype=self.dtype, gradient_checkpointing=self.gradient_checkpointing)

Would you like to open a new PR to add this one line?

Yes , will do

* Add FlaxWhisperForAudioClassification model * Add models to init * Add models to init * Fix copies * Fix automapping * Fix failing test

ydshieh · 2023-06-07T10:01:12Z

tests/models/whisper/test_modeling_whisper.py

@@ -1430,7 +1430,7 @@ def __init__(
        hidden_dropout_prob=0.1,
        attention_probs_dropout_prob=0.1,
        max_position_embeddings=20,
-        max_source_positions=30,
+        max_source_positions=1500,


Hi @raghavanone @sanchit-gandhi

I am wondering why we increase these values a lot ...?

Whisper works intrinsically on a sequence length of 30s inputs (which corresponds to 1500 log mel spectrogram frames)

We could use a shorter context window (i.e. 30s -> 15s), we just need to initialise the weights accordingly

Fixed in #24105

* Add FlaxWhisperForAudioClassification model * Add models to init * Add models to init * Fix copies * Fix automapping * Fix failing test

raghavanone added 5 commits May 5, 2023 21:26

Add FlaxWhisperForAudioClassification model

539be94

Add models to init

ffc29c6

Add models to init

c181c77

Fix copies

f2e1431

Fix automapping

a5741d7

Fix failing test

058ec2a

sgugger approved these changes May 5, 2023

View reviewed changes

sgugger merged commit 312b104 into huggingface:main May 5, 2023

sanchit-gandhi reviewed May 9, 2023

View reviewed changes

This was referenced May 11, 2023

Add gradient_checkpointing parameter to FlaxWhisperEncoder. #23299

Closed

Add gradient_checkpointing parameter to FlaxWhisperEncoder #23300

Merged

ydshieh reviewed Jun 7, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add FlaxWhisperForAudioClassification model #23173

Add FlaxWhisperForAudioClassification model #23173

raghavanone commented May 5, 2023

HuggingFaceDocBuilderDev commented May 5, 2023 •

edited

Loading

sgugger commented May 5, 2023

sgugger commented May 5, 2023

sanchit-gandhi commented May 5, 2023 •

edited

Loading

raghavanone commented May 5, 2023

sgugger left a comment

sanchit-gandhi May 9, 2023

raghavanone May 9, 2023

ydshieh Jun 7, 2023

sanchit-gandhi Jun 8, 2023 •

edited

Loading

sanchit-gandhi Jun 8, 2023

	self.encoder = FlaxWhisperEncoder(config=self.config, dtype=self.dtype)
	self.encoder = FlaxWhisperEncoder(config=self.config, dtype=self.dtype, gradient_checkpointing=self.gradient_checkpointing)

Add FlaxWhisperForAudioClassification model #23173

Add FlaxWhisperForAudioClassification model #23173

Conversation

raghavanone commented May 5, 2023

HuggingFaceDocBuilderDev commented May 5, 2023 • edited Loading

sgugger commented May 5, 2023

sgugger commented May 5, 2023

sanchit-gandhi commented May 5, 2023 • edited Loading

raghavanone commented May 5, 2023

sgugger left a comment

Choose a reason for hiding this comment

sanchit-gandhi May 9, 2023

Choose a reason for hiding this comment

raghavanone May 9, 2023

Choose a reason for hiding this comment

ydshieh Jun 7, 2023

Choose a reason for hiding this comment

sanchit-gandhi Jun 8, 2023 • edited Loading

Choose a reason for hiding this comment

sanchit-gandhi Jun 8, 2023

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented May 5, 2023 •

edited

Loading

sanchit-gandhi commented May 5, 2023 •

edited

Loading

sanchit-gandhi Jun 8, 2023 •

edited

Loading