Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Port SwinTransformer3d from torchmultimodal #6499

Closed
oke-aditya opened this issue Aug 25, 2022 · 1 comment · Fixed by #6521
Closed

Port SwinTransformer3d from torchmultimodal #6499

oke-aditya opened this issue Aug 25, 2022 · 1 comment · Fixed by #6521

Comments

@oke-aditya
Copy link
Contributor

🚀 The feature

The main Idea is to port the SwinTransformer3d model from torchmulitmodal to torchvision.

Need to keep in mind the nuances and code structure of torchvision

https://github.com/facebookresearch/multimodal/blob/main/torchmultimodal/modules/encoders/swin_transformer_3d_encoder.py

https://github.com/facebookresearch/multimodal/blob/main/examples/omnivore/LoadOriginalPretrainedWeightAndCompare.ipynb

We need to port the implementation as well as the weights.

Motivation, pitch

The idea is to first port SwinTransformer3dV1 and port its weights successfully. Once done we can then think of having SwinTransformer3dV2 (there is no such paper or implementation but maybe it will benefit like the 2d case)

Alternatives

No response

Additional context

Additionally in discussion with @YosuaMichael the paper also mentioned that SwinTransformerV2 can be used for object detection tasks. If possible we should explore it (but only after we finish previous things)

@oke-aditya
Copy link
Contributor Author

Just a quick update. I have started working on this, (sadly my technical knowledge needed a bit of refresher (thnx to Java and other tech work)). I have read through the paper of ViT, SwinTransformer. Will go through the video variant over next 2 days, verify the implementation and open a PR.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant