Updated paper on the latest model (video understanding, etc.) #38

thecooltechguy · 2024-05-05T04:01:57Z

Congrats on adding support for video understanding to VILA, looks super cool!

Just curious, is there an updated or new paper with more technical details on how improved video understanding was added to the VILA model?

Thanks!

Lyken17 · 2024-05-07T06:52:31Z

Hi @thecooltechguy

The main benefits comes from training data improvement during the pre-training.

We are working on techinical papers and plan to reveal more details once ready :)

hkunzhe · 2024-05-10T06:46:22Z

@Lyken17, Great work! Looking forward to the technical paper!

hkunzhe · 2024-05-23T02:10:12Z

@Lyken17 Hi, I noticed that the paper was updated a few days ago, but it still does not mention the capability for video understanding. After comparing VILA's initial submission and version 1.5, I found that the pre-training dataset only added ShareGPT4v, while in SFT, video-related datasets such as shot2story/ShareGPT4Video were added. Moreover, the model was switched from llama2 + clip to llama3 + siglip/internvit. Could you elaborate on that in more detail?

Lyken17 · 2024-06-21T02:22:21Z

We will release the arxiv in sometime in the July. Stay tuned :)

Test CI unites

gheinrich pushed a commit to gheinrich/VILA that referenced this issue Dec 16, 2024

Merge pull request NVlabs#38 from Efficient-Large-Model/Lyken17-patch-2

f5bc105

Test CI unites

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated paper on the latest model (video understanding, etc.) #38

Updated paper on the latest model (video understanding, etc.) #38

thecooltechguy commented May 5, 2024

Lyken17 commented May 7, 2024

hkunzhe commented May 10, 2024

hkunzhe commented May 23, 2024

Lyken17 commented Jun 21, 2024

Updated paper on the latest model (video understanding, etc.) #38

Updated paper on the latest model (video understanding, etc.) #38

Comments

thecooltechguy commented May 5, 2024

Lyken17 commented May 7, 2024

hkunzhe commented May 10, 2024

hkunzhe commented May 23, 2024

Lyken17 commented Jun 21, 2024