Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

🚀 [Release Notes] 2024.10 #116

Open
lixin4ever opened this issue Oct 26, 2024 · 0 comments
Open

🚀 [Release Notes] 2024.10 #116

lixin4ever opened this issue Oct 26, 2024 · 0 comments
Labels
enhancement New feature or request good first issue Good for newcomers

Comments

@lixin4ever
Copy link
Contributor

Although we encountered several unexpected difficulties (like the lack of computing resources and manpower) in the past few months, we are constantly maintaining this repo and trying to deliver some new stuff to the community. In this release (202410), we provide two new models:

  1. VideoLLaMA2.1-7B-16F
    • Supercharging VideoLLaMA2 with SigLIP and Qwen2
    • Training VideoLLaMA2 on more textual data (largely from Magpie and ALLaVA) to enhance the instruction following capability
    • Improved results on almost all of the benchmarks
Model Egoschema Perception-Test MVBench VideoMME MSVC (Caption) ActivityNet-QA
VideoLLaMA2-7B-16F 51.7 51.4 54.6 47.9/50.3 2.53/2.59 50.2/3.3
VideoLLaMA2.1-7B-16F 53.1 54.9 57.3 54.9/56.4 2.87/2.81 53.0/3.4
  1. VideoLLaMA2.1-7B-AV
    • Trained from VideoLLaMA2.1-7B-16F
    • Included more audio-visual joint training data (from AVInstruct) and more pure-text data
    • Improved training recipes (e.g., we found that smaller batch sizes in audio-related training always give better results)
@lixin4ever lixin4ever added enhancement New feature or request good first issue Good for newcomers labels Oct 26, 2024
@clownrat6 clownrat6 pinned this issue Oct 30, 2024
@clownrat6 clownrat6 changed the title Release notes at 202410 🚀 [Release Notes] 2024.10 Oct 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

1 participant