Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added a Gradio UI for multi-modal inferencing using Llama 3.2 Vision/ #718

Merged
merged 17 commits into from
Oct 28, 2024

Conversation

himanshushukla12
Copy link
Contributor

@himanshushukla12 himanshushukla12 commented Oct 8, 2024

What does this PR do?

This PR introduces multi-modal inference using the Gradio UI for Llama 3.2 vision models. The Gradio UI allows users to upload images and generate descriptive text based on a prompt, with adjustable parameters such as top-k, max-tokens, temperature and top-p for fine-tuning text generation. With chatbox like interface.

Additionally, this PR:

Integrates the transformers and accelerate libraries for efficient model loading and inference.
Implements memory management for releasing GPU resources after inference.
Adds support for Hugging Face tokens to authenticate and access Llama models.

Copy link
Contributor

@init27 init27 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the super fast PR! I left some requests

@himanshushukla12
Copy link
Contributor Author

@init27 I did the changes you asked, please check and let me know... I'll be happy to make it better

Modified readme for new code for passing token via argument
Used small case "g" in gradio
@himanshushukla12
Copy link
Contributor Author

@init27 added the changes you asked, please check...

@himanshushukla12
Copy link
Contributor Author

@init27 please let me know if anything required...
I'm waiting for your response😄

@himanshushukla12
Copy link
Contributor Author

@init27 can you check the PR, I renamed the filename too.

Copy link
Contributor

@init27 init27 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks great, thanks for the PR!

@init27
Copy link
Contributor

init27 commented Oct 27, 2024

@himanshushukla12 sorry for missing this-looks great!

Can you please make sure the CI/CD is green and merge?

@himanshushukla12
Copy link
Contributor Author

@init27 please check, everything is working fine.

@init27 init27 merged commit 2e99c1e into meta-llama:main Oct 28, 2024
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants