Port SwinTransformer3d from torchmultimodal #6499

oke-aditya · 2022-08-25T19:22:00Z

🚀 The feature

The main Idea is to port the SwinTransformer3d model from torchmulitmodal to torchvision.

Need to keep in mind the nuances and code structure of torchvision

https://github.com/facebookresearch/multimodal/blob/main/torchmultimodal/modules/encoders/swin_transformer_3d_encoder.py

https://github.com/facebookresearch/multimodal/blob/main/examples/omnivore/LoadOriginalPretrainedWeightAndCompare.ipynb

We need to port the implementation as well as the weights.

Motivation, pitch

The idea is to first port SwinTransformer3dV1 and port its weights successfully. Once done we can then think of having SwinTransformer3dV2 (there is no such paper or implementation but maybe it will benefit like the 2d case)

Alternatives

No response

Additional context

Additionally in discussion with @YosuaMichael the paper also mentioned that SwinTransformerV2 can be used for object detection tasks. If possible we should explore it (but only after we finish previous things)

oke-aditya · 2022-08-28T18:37:13Z

Just a quick update. I have started working on this, (sadly my technical knowledge needed a bit of refresher (thnx to Java and other tech work)). I have read through the paper of ViT, SwinTransformer. Will go through the video variant over next 2 days, verify the implementation and open a PR.

datumbox mentioned this issue Aug 26, 2022

[RFC] Batteries Included - Phase 3 #6323

Open

16 tasks

oke-aditya mentioned this issue Aug 30, 2022

Add Video SwinTransformer #6521

Merged

YosuaMichael closed this as completed in #6521 Nov 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Port SwinTransformer3d from torchmultimodal #6499

Port SwinTransformer3d from torchmultimodal #6499

oke-aditya commented Aug 25, 2022

oke-aditya commented Aug 28, 2022

Port SwinTransformer3d from torchmultimodal #6499

Port SwinTransformer3d from torchmultimodal #6499

Comments

oke-aditya commented Aug 25, 2022

🚀 The feature

Motivation, pitch

Alternatives

Additional context

oke-aditya commented Aug 28, 2022