Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for BigBird #143

Open
tanmaylaud opened this issue Apr 8, 2021 · 3 comments · May be fixed by #425
Open

Add support for BigBird #143

tanmaylaud opened this issue Apr 8, 2021 · 3 comments · May be fixed by #425
Labels
enhancement New feature or request

Comments

@tanmaylaud
Copy link

🌟 New adapter setup

Model description

Big Bird is a new model available in huggingface ( efficient transformer)

Open source status

  • [ X] the model implementation is available: (give details)
  • [ X] the model weights are available: (give details)
@tanmaylaud tanmaylaud added the enhancement New feature or request label Apr 8, 2021
@AmirAktify
Copy link

I might attempt this one next since we use it in production, unless anyone else wants to pick it up?

I know the T5 attempt was pretty unwieldy/slow on my part, but I think I learned a fair bit about the codebase and it should hopefully go smoother next time.

It would be interesting to see if one could get the adapters to interact with the sparse attention mechanism non-trivially, but that would be ambitious.

@calpt
Copy link
Member

calpt commented Oct 19, 2021

Hey @AmirAktify, would be great if you could implement this, there's no one from our side currently working on it.

Also, thanks again for your help on the T5 implementation which is finally released now and where you did most of the work. We're always happy for any help we can get from the community, therefore really appreciating your contributions 😃

@h4ste
Copy link

h4ste commented Aug 3, 2022

Any progress on this? I can probably help with implementation.

@AmirAktify AmirAktify removed their assignment Aug 19, 2022
@calpt calpt linked a pull request Oct 13, 2022 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants