-
Notifications
You must be signed in to change notification settings - Fork 27.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Extending Encoder Decoder to GPT-2 #4961
Comments
It's on the roadmap :-) |
Thank you! Look forward to it :) |
Hi - I've actually been working on this myself the past couple days, should I submit a PR when finished? |
That'd be great! |
Will do - likely sometime this week. |
@djw1809 Any update on the PR? :) |
@patrickvonplaten Hello Patrick, I am watching with much interest EncodeDecoder from transformers :) . Any updates on supporting GPT2 with EncodeDecoder ? |
Got sidetracked with other research - coming back to it in several days,
working on my end, just need to play nice with the rest of the repo.
…On Tue, Jul 7, 2020 at 3:32 PM Mihai Ilie ***@***.***> wrote:
@patrickvonplaten <https://github.com/patrickvonplaten> Hello Patrick, I
am watching with much interest EncodeDecoder from transformers :) . Any
updates on supporting GPT2 with EncodeDecoder ?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#4961 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AG3PODZXYPBB33F4CBSNZLDR2OO7VANCNFSM4N4QTZQA>
.
--
Dylan Weber, Research Assistant | PhD Candidate
School of Math and Statistical Sciences
WXLR642/BYENG593 Arizona State University
|
@djw1809 - also feel free to already open a PR with unfinished code yet so that I can take a look early on and help you :-) |
Working on it now. Also linking this PR: #4483 |
@patrickvonplaten Hello Patrick. |
Hey @Squire-tomsk, I see what you mean - this would mean to add a new config param for each model that has cross-attention...is this common practice? Would be great if you could open a new issue for that :-) |
Done #6645 |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
Adding GPT2 initialization for EncoderDecoder model as pointed out in the issue below.
Originally posted by @patrickvonplaten in #4517 (comment)
The text was updated successfully, but these errors were encountered: