MobileLlama3: Llama3 on Mobile

This repository is an implementation of quantizing and converting the Llama3-8B-Instruct model weights and deploying it on Android for on-device inference.

Pipeline:

Demo Output:

Resources:

Colab notebook to quantize and convert Llama3-8B-Instruct model
HuggingFace repository for Llama3-8B-Instruct converted weights.
Medium blog for step-by-step implementation to deploy Llama-3-8B-Instruct on Android.
Medium blog to set up environment on Google Cloud Platform VM instance.
Install the APK directly.

Citation

@software{mlc-llm,
    author = {MLC team},
    title = {{MLC-LLM}},
    url = {https://github.com/mlc-ai/mlc-llm},
    year = {2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
mobile-llama3		mobile-llama3
LICENSE		LICENSE
Llama3_on_Mobile.ipynb		Llama3_on_Mobile.ipynb
README.md		README.md
llama3_icon.png		llama3_icon.png
mobile-llama3-pipeline.png		mobile-llama3-pipeline.png
mobilellama3.apk		mobilellama3.apk
mobilellama3.gif		mobilellama3.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MobileLlama3: Llama3 on Mobile

Pipeline:

Demo Output:

Resources:

Citation

About

Releases

Packages

Languages

License

NSTiwari/Llama3-on-Mobile

Folders and files

Latest commit

History

Repository files navigation

MobileLlama3: Llama3 on Mobile

Pipeline:

Demo Output:

Resources:

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages