Added a new Spectrogram layer based on Conv1D operations, supporting GPU-parallelization and fine-tuning #20313

mostafa-mahmoud · 2024-10-01T19:07:58Z

Added a Spectrogram layer that computes several modes of the Spectrograms (like the magnitude, log-magnitude, power spectral density). The computation aims to serve as an official standardized layer for computing spectrograms while being part of the model. The computations are based on Conv1D operations, which makes it parallelizable when running the model on GPUs. Also, since the computations are based on trainable kernels (which can be adjusted to be trainable or not like any other kernel), further fine-tuning of the Spectrogram computations is possible.

…rts GPU-parallelization and fine-tuning.

google-cla · 2024-10-01T19:08:03Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

codecov-commenter · 2024-10-01T19:22:26Z

Codecov Report

Attention: Patch coverage is 97.64706% with 4 lines in your changes missing coverage. Please review.

Project coverage is 78.87%. Comparing base (6ec0f46) to head (738e774).
Report is 4 commits behind head on master.

Files with missing lines	Patch %	Lines
keras/src/initializers/constant_initializers.py	95.34%	1 Missing and 1 partial ⚠️
keras/api/_tf_keras/keras/initializers/__init__.py	0.00%	1 Missing ⚠️
keras/src/layers/preprocessing/stft_spectrogram.py	99.17%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #20313      +/-   ##
==========================================
+ Coverage   78.81%   78.87%   +0.06%     
==========================================
  Files         512      513       +1     
  Lines       49058    49233     +175     
  Branches     9033     9075      +42     
==========================================
+ Hits        38665    38834     +169     
- Misses       8529     8532       +3     
- Partials     1864     1867       +3

Flag	Coverage Δ
keras	`78.73% <97.64%> (+0.06%)`	⬆️
keras-jax	`62.38% <96.47%> (+0.11%)`	⬆️
keras-numpy	`57.40% <55.88%> (-0.01%)`	⬇️
keras-tensorflow	`63.64% <88.23%> (+0.08%)`	⬆️
keras-torch	`62.37% <96.47%> (+0.11%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

fchollet

Interesting feature, thanks for the PR!

What would be the main use cases for this? And what is the usage pattern in terms of what initializer to use, when to set trainable = False, etc? Can you a simple tutorial that demonstrates the value?

fchollet · 2024-10-01T21:18:03Z

keras/src/layers/preprocessing/spectrogram.py

+
+
+@keras_export("keras.layers.Spectrogram")
+class Spectrogram(layers.Layer):


"spectrogram" is a bit generic, maybe there could be a more specific name?

I renamed to STFTSpectrogram, which is more specific.

However, I aimed for this to be extended in later PRs to also include Mel-Spectrogram, LogMel-Spectrogram, and MFCCs. These are all audio-based spectrograms, unlike the layer I just committed which is more generic for time-series signals generally. Supporting these output modes would require extra computations at the end of the __call__ function.

If all of these variations would be in one layer in the future, then maybe having the name Spectrogram is better, which will make this more generic. However, if this is too monolithic and should be handled in a new layer(maybe inheriting from the current layer), then I think the current naming STFTSpectrogram is sufficient.

What do you think? Should I use STFTSpectrogram or Spectrogram? (keeping in mind the possible future extension to Mel-Spectrograms and MFCCs)

keras/src/layers/preprocessing/spectrogram.py

keras/src/layers/preprocessing/spectrogram_test.py

… imports for spectrogram layer

mostafa-mahmoud · 2024-10-02T02:03:04Z

Interesting feature, thanks for the PR!

What would be the main use cases for this? And what is the usage pattern in terms of what initializer to use, when to set trainable = False, etc? Can you a simple tutorial that demonstrates the value?

This layer comes with its default initializer (the STFTInitializer in this pushed code), which computes STFT out-of-the-box.

The main two use-cases are:

Set trainable=False, and this is purely a preprocessing layer.
Set trainable=True, then this becomes a fine-tuned Spectrogram, which can be beneficial in some cases. Cheuk et. al. 2020 study this.

I can craft an example using this layer.

There are mainly three reasons for this this contribution:

Standardize the spectrogram preprocessing as part of the model. Also, this allows deploying the model directly without taking the preprocessing code to the server.

I often had issues with different implementations across different libraries.

The convenience of this layer can make it a standard practice, especially, with the serialization in keras.
Using implementation based on convolutions, which makes the operation faster on GPUs due to parallelization, compared to the standard FFT. It also resolves some bottlenecks of preprocessing the Spectrogram on CPU before transferring it to GPU.

All of these allow faster execution.
The use case of trainable=True, which could lead to some improvements.

K. W. Cheuk, H. Anderson, K. Agres and D. Herremans, "nnAudio: An on-the-Fly GPU Audio to Spectrogram Conversion Toolbox Using 1D Convolutional Neural Networks," in IEEE Access, vol. 8, pp. 161981-162003, 2020.

…ges in STFTSpectrogram.

mostafa-mahmoud · 2024-10-04T10:55:56Z

@fchollet I added another use case of outputting 2D images as well as the original 1D time signals. I also added a code example as you mentioned, you will find the PR on keras-io here.

fchollet

Thanks for the update!

keras/src/layers/preprocessing/stft_spectrogram.py

fchollet · 2024-10-05T03:43:24Z

keras/src/layers/preprocessing/stft_spectrogram.py

+        )
+        output = spectrogram_layer(input_signal)
+        ```
+


Since this layer is fairly niche, please illustrate its usage with detailed code examples (with descriptions of what they do and what they'd be used for), e.g. "different modes of output", "non trainable init vs fine-tuning", etc.

The user should be able to read the docstring and understand: what is this layer for? when would I need it? how do I use it?

I added now three code examples demonstrating the different use cases. Also check the more comprehensive tutorial I added here.

…ittests for it. Added examples in the docstrings of the STFTSpecgrogram

…the STFT modules.

fchollet

Looking good -- thank you for the contribution!

Added a new Spectrogram layer based on Conv1D operations, which suppo…

0d64a17

…rts GPU-parallelization and fine-tuning.

google-ml-butler bot added the size:L label Oct 1, 2024

google-ml-butler bot assigned gbaned Oct 1, 2024

fixed code formatting and added test coverage.

4016bd1

fchollet reviewed Oct 1, 2024

View reviewed changes

mostafa-mahmoud added 5 commits October 2, 2024 03:23

special handling for spectrogram tests for the jax backend, built api…

2ce5b9e

… imports for spectrogram layer

fixing code formatting in spectrogram layer

c196a07

renaming Spectrogram -> STFTSpectrogram for clarity

aebfe56

Merge branch 'keras-team:master' into master

00e403c

code formatting in stft_spectrogram

9917875

mostafa-mahmoud added 5 commits October 3, 2024 20:16

Merge branch 'keras-team:master' into master

c46a75a

added support for multiple channels and expanding spectrograms to ima…

7c758d7

…ges in STFTSpectrogram.

fixing code styles

8b61e53

Stricter data_format in testing of STFTSpectrogram

4eff297

minor fix in stft_spectrogram_test

e46a007

mostafa-mahmoud mentioned this pull request Oct 4, 2024

Example for STFTSpectrogram layer keras-team/keras-io#1947

Merged

fchollet reviewed Oct 5, 2024

View reviewed changes

mostafa-mahmoud added 5 commits October 5, 2024 14:14

moving STFTInitializer to the constant_initializers module and add un…

42995eb

…ittests for it. Added examples in the docstrings of the STFTSpecgrogram

fixing code formatting error in constant_initializers

ab5bd53

Added tf.keras API import for STFTInitializer

c1ebb3c

Added more coverage and handled a precision issue in jax backend for …

1e3e0e0

…the STFT modules.

minor fix in the STFT tests

738e774

fchollet approved these changes Oct 5, 2024

View reviewed changes

google-ml-butler bot added kokoro:force-run ready to pull Ready to be merged into the codebase labels Oct 5, 2024

fchollet merged commit f52f9f5 into keras-team:master Oct 5, 2024
6 checks passed

google-ml-butler bot removed ready to pull Ready to be merged into the codebase kokoro:force-run labels Oct 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added a new Spectrogram layer based on Conv1D operations, supporting GPU-parallelization and fine-tuning #20313

Added a new Spectrogram layer based on Conv1D operations, supporting GPU-parallelization and fine-tuning #20313

mostafa-mahmoud commented Oct 1, 2024

google-cla bot commented Oct 1, 2024

codecov-commenter commented Oct 1, 2024 •

edited

Loading

fchollet left a comment

fchollet Oct 1, 2024

mostafa-mahmoud Oct 2, 2024

mostafa-mahmoud commented Oct 2, 2024 •

edited

Loading

mostafa-mahmoud commented Oct 4, 2024

fchollet left a comment

fchollet Oct 5, 2024

mostafa-mahmoud Oct 5, 2024

fchollet left a comment



		@keras_export("keras.layers.Spectrogram")
		class Spectrogram(layers.Layer):

Added a new Spectrogram layer based on Conv1D operations, supporting GPU-parallelization and fine-tuning #20313

Added a new Spectrogram layer based on Conv1D operations, supporting GPU-parallelization and fine-tuning #20313

Conversation

mostafa-mahmoud commented Oct 1, 2024

google-cla bot commented Oct 1, 2024

codecov-commenter commented Oct 1, 2024 • edited Loading

Codecov Report

fchollet left a comment

Choose a reason for hiding this comment

fchollet Oct 1, 2024

Choose a reason for hiding this comment

mostafa-mahmoud Oct 2, 2024

Choose a reason for hiding this comment

mostafa-mahmoud commented Oct 2, 2024 • edited Loading

mostafa-mahmoud commented Oct 4, 2024

fchollet left a comment

Choose a reason for hiding this comment

fchollet Oct 5, 2024

Choose a reason for hiding this comment

mostafa-mahmoud Oct 5, 2024

Choose a reason for hiding this comment

fchollet left a comment

Choose a reason for hiding this comment

codecov-commenter commented Oct 1, 2024 •

edited

Loading

mostafa-mahmoud commented Oct 2, 2024 •

edited

Loading