Add `Video Swin Transformer` Model #2262

innat · 2023-12-23T20:16:15Z

Short Description

Video Swin Transformer is a pure transformer based video modeling algorithm, attained top accuracy on the major video recognition benchmarks.

Papers

https://arxiv.org/abs/2106.13230
published in 2021, Cited by 1154 (until now).

Existing Implementations

PyTorch (official): https://github.com/SwinTransformer/Video-Swin-Transformer
TorchVision : https://pytorch.org/vision/main/models/video_swin_transformer.html
Keras 2: https://github.com/innat/VideoSwin.
Keras 3: https://github.com/innat/VideoSwin/tree/feat_kerasv3

Other Information

divyashreepathihalli · 2023-12-27T09:03:26Z

@innat Thanks for filing the issue! Are you interested in contributing?

innat · 2023-12-30T11:08:36Z

@divyashreepathihalli

@innat Thanks for filing the issue! Are you interested in contributing?

Unfortunately I don't have long bandwidth to keep working on this feature, (I've noticed there are many pending PR). Therefore, unless there is a high-priority inclusion of this feature in kerascv's current roadmap, I am willing to offer guidance to any contributor interested. Thank you for your understanding.

simeetnayan81 · 2024-01-03T03:57:06Z

Hey @innat @divyashreepathihalli. This project seems interesting and I wish to contribute. Will require some guidance too since I am new to Keras codebase.

innat · 2024-01-05T12:56:07Z

@simeetnayan81
As you're new to keras-cv, first take a look how they iimplemented backbone and image classification task. According to that, you may start adding video swin as backbone and create video classifier as high level task. For model implementation in keras-v3, please check the first post.

divyashreepathihalli · 2024-01-05T18:51:17Z

Thank you @simeetnayan81 for your interest and thank you @innat for your help! The team currently does not have bandwidth for this. We appreciate the help!!

ID6109 · 2024-01-06T00:18:02Z

Hey @innat @divyashreepathihalli! I'd love to add this model to the codebase. I have prior experience with handling the models implemented in KerasCV as well. Thanks!

innat · 2024-01-06T10:13:18Z

@ID6109 @simeetnayan81
Thank you both. Feel free to start working. You guys can collaborate to each other. Check out the resource I shared in the first post.

Note, unlike image model which only have imagenet weight currently, video mdoels often comes with pretrained weight for mutliple dataset, i.e. kinetrics, something something. Also, their rescaling can be different. At first, you don't need to worry about weight, just start adding backbone and high level classifier.

divyashreepathihalli · 2024-01-09T21:25:10Z

Created a branch - https://github.com/keras-team/keras-cv/tree/video-swin-transformer
please collaborate on this branch and then we can open a PR to master from here.

innat · 2024-03-15T18:26:48Z

@divyashreepathihalli
Is it acceptable to take guideline from practitioner?

sachinprasadhs added type:feature stat:contributions welcome labels Jan 5, 2024

innat mentioned this issue Jan 27, 2024

Video swin #2319

Closed

5 tasks

innat mentioned this issue Mar 5, 2024

Add Video Swin Transformer #2369

Merged

5 tasks

divyashreepathihalli assigned tirthasheshpatel Mar 19, 2024

divyashreepathihalli closed this as completed in #2369 Apr 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `Video Swin Transformer` Model #2262

Add `Video Swin Transformer` Model #2262

innat commented Dec 23, 2023 •

edited

Loading

divyashreepathihalli commented Dec 27, 2023

innat commented Dec 30, 2023 •

edited

Loading

simeetnayan81 commented Jan 3, 2024 •

edited

Loading

innat commented Jan 5, 2024

divyashreepathihalli commented Jan 5, 2024

ID6109 commented Jan 6, 2024

innat commented Jan 6, 2024

divyashreepathihalli commented Jan 9, 2024

innat commented Mar 15, 2024 •

edited

Loading

Add Video Swin Transformer Model #2262

Add Video Swin Transformer Model #2262

Comments

innat commented Dec 23, 2023 • edited Loading

divyashreepathihalli commented Dec 27, 2023

innat commented Dec 30, 2023 • edited Loading

simeetnayan81 commented Jan 3, 2024 • edited Loading

innat commented Jan 5, 2024

divyashreepathihalli commented Jan 5, 2024

ID6109 commented Jan 6, 2024

innat commented Jan 6, 2024

divyashreepathihalli commented Jan 9, 2024

innat commented Mar 15, 2024 • edited Loading

Add `Video Swin Transformer` Model #2262

Add `Video Swin Transformer` Model #2262

innat commented Dec 23, 2023 •

edited

Loading

innat commented Dec 30, 2023 •

edited

Loading

simeetnayan81 commented Jan 3, 2024 •

edited

Loading

innat commented Mar 15, 2024 •

edited

Loading