-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[gpt2bb spec] TransformerDecoder #7842
Conversation
/** | ||
* Query `Tensor` of shape `(B, T, dim)`. | ||
*/ | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Missing query?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the review! This was done purposely since I'm using query
as technically the first argument in call
(just renamed inputs
to query
here), but wanted to add this spec about it somewhere. This pattern happens in other layers too. Is there another way I can include this query
doc?
I didn't want to include query
here and then just have an unused first parameter to call
. Let me know what you think - thanks!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
@Linchenn Please take a look when you get a chance. Thanks! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
Adds the spec for the
TransformerDecoder
including any dependent layers.EinsumDense
TransformerDecoder