Implement CachedMultiHeadAttention layer #7882

pforderique · 2023-07-31T15:06:01Z

Implements the CachedMultiHeadAttention layer from Keras-NLP attention layers.

Depends on #7875 (MultiHeadAttention implementation)

Linchenn

LGTM!

mattsoulanille

LGTM!

tfjs-layers/src/layers/advanced_activations.ts

pforderique and others added 20 commits July 13, 2023 09:50

Add spec for multi-head attention

ea472e2

Merge branch 'main' into spec-transformer

20f8358

Add CachedMultiHeadAttention cache

41a105e

Fix typos

6e78ffc

Lint

01d9e2e

Add Transformer Decoder spec

8f08c19

lint

4713c4e

Add Einsum spec

37aca1a

lint

2a6d929

Remove unused type declaration

6dcb7a0

Merge branch 'main' into spec-transformer

db6fc8d

Move helper functions outside EinsumDense class

e589817

Implement Einsum Dense

9bafba5

Address comments

4428cf1

Merge branch 'main' into einsum-dense-impl

871dc34

Implement MHA Layer

9e54a15

Add masked softmax support

acb83e2

Merge branch 'main' into mha-impl

89d4f62

Merge branch 'main' into mha-impl

97c16bf

Add CMHA impl and tests

516bd4f

pforderique marked this pull request as ready for review July 31, 2023 16:33

pforderique requested review from mattsoulanille and Linchenn July 31, 2023 16:33

pforderique and others added 2 commits July 31, 2023 16:40

Fix typo

360a841

Merge branch 'master' into cmha-impl

dce07e4

pforderique mentioned this pull request Aug 4, 2023

Implement TransformerDecoder layer #7890

Merged

Merge branch 'main' into cmha-impl

c5487a0

Linchenn approved these changes Aug 7, 2023

View reviewed changes

mattsoulanille approved these changes Aug 7, 2023

View reviewed changes

tfjs-layers/src/layers/advanced_activations.ts Outdated Show resolved Hide resolved

pforderique enabled auto-merge (squash) August 7, 2023 22:00

Merge branch 'main' into cmha-impl

1867f47

pforderique merged commit 428ab7c into tensorflow:master Aug 8, 2023

pforderique deleted the cmha-impl branch August 8, 2023 17:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement CachedMultiHeadAttention layer #7882

Implement CachedMultiHeadAttention layer #7882

pforderique commented Jul 31, 2023

Linchenn left a comment

mattsoulanille left a comment

Implement CachedMultiHeadAttention layer #7882

Implement CachedMultiHeadAttention layer #7882

Conversation

pforderique commented Jul 31, 2023

Linchenn left a comment

Choose a reason for hiding this comment

mattsoulanille left a comment

Choose a reason for hiding this comment