Added quantisation layer, commitment loss, and associated unit tests #29

RdoubleA · 2022-04-26T23:16:22Z

Summary:

Vector Quantized Variational Autoencoder (VQVAE) is widely used to generate multimodal data (audio, image, or video samples). A well-known model implementing this is OpenAI's DALL-E model, which can generate images from text captions. The core component of the VQVAE is the quantisation layer, which is a discrete embedding of vectors that codifies the outputs of an encoder. Here, this layer is implemented as its own module, along with commitment loss which is used to fine-tune the encoder.

Test plan:

Unit tests were created and added in this PR. Testing was done with PyTest.

pytest -vv test/modules/losses/test_commitment.py

pytest -vv test/modules/layers/test_quantisation.py

ankitade

overall good start. some questions / suggestions

torchmultimodal/modules/losses/vqvae.py

torchmultimodal/modules/layers/quantisation.py

torchmultimodal/modules/losses/vqvae.py

test/modules/layers/test_quantisation.py

torchmultimodal/utils/preprocess.py

torchmultimodal/modules/layers/quantisation.py

test/modules/losses/test_commitment.py

test/modules/layers/test_quantisation.py

torchmultimodal/utils/preprocess.py

test/modules/layers/test_quantisation.py

ebsmothers

Looks good! I like the way you structured the code, and adding the reshaping utils will come in handy as well.

torchmultimodal/modules/losses/vqvae.py

torchmultimodal/utils/preprocess.py

torchmultimodal/modules/layers/quantisation.py

…added straight thru estimator

test/modules/losses/test_commitment.py

torchmultimodal/modules/layers/quantisation.py

langong347 · 2022-04-28T14:45:17Z

Please include in the Test Plan:

the exact command you used to run unit tests regarding your changes
testing results

…puts

test/modules/layers/test_quantisation.py

torchmultimodal/modules/layers/quantisation.py

langong347 · 2022-04-28T20:06:17Z

Sorry I probably didn't make the purpose of Test Plan clear: You can just run tests related to your changes, e.g., pytest -vv test.modules.layers.test_quantisation.py something like this instead of executing all tests from the library.

langong347

Thanks for the nice work! Going through the threads in this PR I can see a few points to follow up:

Learning of the codebook needs to be implemented by adding VQ loss or as moving average of the encoder output.
Re-design of the commitment loss (or vqvae loss as a whole) and clean up API of the quantization layer.

Also I have a question for you:

The commitment loss is computed with the quantized vectors detached before returned by forward as in OpenAI and the Google example. In you current implementation, the quantized vectors are detached after (inside CommitmentLoss). Do you think the two steps are commutative?

Let me know your thoughts about those observation and if anything that might be missing.

Next step: click "import to fbsource" button below and wait for the internal CI to finish, fix any issues as needed.

RdoubleA · 2022-04-29T02:50:16Z

Thanks for the nice work! Going through the threads in this PR I can see a few points to follow up:

Learning of the codebook needs to be implemented by adding VQ loss or as moving average of the encoder output.

Re-design of the commitment loss (or vqvae loss as a whole) and clean up API of the quantization layer.

Also I have a question for you:

The commitment loss is computed with the quantized vectors detached before returned by forward as in OpenAI and the Google example. In you current implementation, the quantized vectors are detached after (inside CommitmentLoss). Do you think the two steps are commutative?

Let me know your thoughts about those observation and if anything that might be missing.

Next step: click "import to fbsource" button below and wait for the internal CI to finish, fix any issues as needed.

I think as long as quantized is detached before computing loss then the gradients should be calculated correctly. Whether the loss is computed inside forward as in OpenAI's implementation or computed outside in a separate loss module as in CommitmentLoss the same tensors are being used, so it should have the same result.

facebook-github-bot · 2022-04-29T02:51:47Z

@RdoubleA has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2022-05-03T17:15:18Z

@RdoubleA has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

added quantisation layer, commitment loss, and associated unit tests

669513b

RdoubleA added the enhancement New feature or request label Apr 26, 2022

RdoubleA requested review from ankitade, langong347 and ebsmothers April 26, 2022 23:16

RdoubleA self-assigned this Apr 26, 2022

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 26, 2022

ankitade reviewed Apr 27, 2022

View reviewed changes

langong347 reviewed Apr 27, 2022

View reviewed changes

ebsmothers reviewed Apr 27, 2022

View reviewed changes

RdoubleA added 3 commits April 27, 2022 12:09

docstrings, type annotations, simplified dist calc, neighbor lookup, …

7e8c0a2

…added straight thru estimator

fixes from unit testing

c32ae3e

refactors utils in preprocess.py inside of quantisation forward

00b88e9

langong347 suggested changes Apr 28, 2022

View reviewed changes

torchmultimodal/modules/layers/quantisation.py Show resolved Hide resolved

langong347 reviewed Apr 28, 2022

View reviewed changes

torchmultimodal/modules/layers/quantisation.py Outdated Show resolved Hide resolved

torchmultimodal/modules/layers/quantisation.py Outdated Show resolved Hide resolved

langong347 reviewed Apr 28, 2022

View reviewed changes

torchmultimodal/modules/layers/quantisation.py Outdated Show resolved Hide resolved

torchmultimodal/modules/layers/quantisation.py Show resolved Hide resolved

refactored forward of Quantisation, updated unit tests with manual in…

135cd76

…puts

langong347 reviewed Apr 28, 2022

View reviewed changes

improved structure of unit tests

14587db

RdoubleA requested a review from langong347 April 28, 2022 23:42

langong347 approved these changes Apr 29, 2022

View reviewed changes

fixing lint issue with assertion test

00b2049

facebook-github-bot closed this in 292219e May 3, 2022

ankitade deleted the vqvae branch December 7, 2022 18:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added quantisation layer, commitment loss, and associated unit tests #29

Added quantisation layer, commitment loss, and associated unit tests #29

RdoubleA commented Apr 26, 2022 •

edited

Loading

ankitade left a comment

ebsmothers left a comment

langong347 commented Apr 28, 2022

langong347 commented Apr 28, 2022

langong347 left a comment •

edited

Loading

RdoubleA commented Apr 29, 2022

facebook-github-bot commented Apr 29, 2022

facebook-github-bot commented May 3, 2022

Added quantisation layer, commitment loss, and associated unit tests #29

Added quantisation layer, commitment loss, and associated unit tests #29

Conversation

RdoubleA commented Apr 26, 2022 • edited Loading

Summary:

Test plan:

ankitade left a comment

Choose a reason for hiding this comment

ebsmothers left a comment

Choose a reason for hiding this comment

langong347 commented Apr 28, 2022

langong347 commented Apr 28, 2022

langong347 left a comment • edited Loading

Choose a reason for hiding this comment

RdoubleA commented Apr 29, 2022

facebook-github-bot commented Apr 29, 2022

facebook-github-bot commented May 3, 2022

RdoubleA commented Apr 26, 2022 •

edited

Loading

langong347 left a comment •

edited

Loading