-
Notifications
You must be signed in to change notification settings - Fork 254
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add XLNetBackbone
#928
Add XLNetBackbone
#928
Conversation
Still WIP. |
Hey @mattdangerw & @chenmoneygithub Turns out that addition of XLNet needs MultiHeadRelativeAttention and TwoStreamRelativeAttention apart from relative encoding, transformerXL layers. Should I break this PR up into multiple ones? I'm not able to make entire model work correctly, just yet. |
@shivance timely question! We actually have a contribution guide for models about to land here. #820 Could you try following that and give us feedback? Re the overall stucture, lets keep everything in a We also should make sure we have the full forward pass matching correctly before we get too deep into review on style, etc. You can usually try making a colab for starters that aligns the forward pass with a reference implementation and share that here! |
@shivance, let me know when this is ready for review! |
Really Sorry for slacking off here, am not able to find time cuz of personal reasons. Will follow up soon. |
No hurry, take your time! Take care! |
Hi @shivance can I please continue this PR? |
Hi @shivance thanks for quick reply, I would love to continue your work. How about I fork your branch and push the commits on your repo(to shivance/keras-nlp_branch...) and you merge them, that way we both will share credits? or if you want I can copy the existing code and make a new PR. What do you say? |
I think if I add you as collaborator on my fork, and then when you push to
my branch, the same PR should get updated without getting closed.
…On Mon, 5 Jun, 2023, 00:47 Susnato Dhar, ***@***.***> wrote:
Hi @shivance <https://github.com/shivance> thanks for quick reply, I
would love to continue your work. How about I fork your branch and push the
commits on your repo(to shivance/keras-nlp_branch...) and you merge them,
that way we both will share credits? or if you want I can copy the existing
code and make a new PR. What do you say?
—
Reply to this email directly, view it on GitHub
<#928 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AMK2NO5D7TXLOOIO572OOITXJTNLNANCNFSM6AAAAAAWIKP6E4>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
then please add me as a collaborator @shivance . |
@susnato just gave! |
Hi @mattdangerw, I am currently working on this integration. I have one small doubt - |
Hi @mattdangerw @abheesht17 , In xlnet implementation there are some optional arguments such as Should I then define the call method in the backbone?(That would make this backbone an exception) or should I do a workaround(eg, instead of passing None we can pass -1e+9)? |
Hi @shivance I am sorry but I think it's better to open a new PR. Since you created this PR I need to tag you all time to ask for changing description, to make it ready for review from draft or asking reviews from maintainers, and also you might get bothered about getting tagged all the time, so I am making a new PR, so please close this one also please do not worry, I am going to continue from your commits. |
Closes #753
Referring XLNetBase, TransformerXL, Two stream Attention