Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Nightly or Weekly Android apk build #2194

Open
EwoutH opened this issue Apr 22, 2024 · 7 comments
Open

[Feature Request] Nightly or Weekly Android apk build #2194

EwoutH opened this issue Apr 22, 2024 · 7 comments
Labels
feature request New feature or request

Comments

@EwoutH
Copy link
Contributor

EwoutH commented Apr 22, 2024

🚀 Feature

Currently and Android APK is provided on https://llm.mlc.ai/docs/deploy/android.html. However, it's already a few months old and not updated on a regular schedule.

It would be great to have a nightly or weekly build of the Android APK, available on an accessible place

If needed, there could be separate versions for Adreno and Mali GPUs.

Motivation

With a nightly or weekly build, users can always grab the latest version and know when a new version will be build after a feature merge.

@tqchen
Copy link
Contributor

tqchen commented Apr 22, 2024

Thanks for th suggestion, deploying to mobile would involve more checks, so likely we are going to follow a release schedule in future and not doing nightly for apks

@EwoutH
Copy link
Contributor Author

EwoutH commented Apr 22, 2024

A (predictable) release schedule would also be great. For example, it looks like the Android APK was updated 3 days ago, but there is no version number or changelog available. This makes difficult to see if certain functionality is supported or not.

@sebastienbo
Copy link

sebastienbo commented Apr 25, 2024

I agree with Ewouth.
You guys have found a hole in the market, you guys have the only android application that can run LLM's on a phone.
I use phi-2, and wow this is great. Phi-3 just came out and that one is even much better then Llama 2 70b in just 2GB of data (incredible if this could work on a phone , especially now that snapdragon has a NPU onboard to offload LLM's -> much faster token generation and consumes less battery)

@tqchen
Copy link
Contributor

tqchen commented May 11, 2024

we are movig towards a JIT based SDK system in https://llm.mlc.ai/docs/deploy/android.html which hopefully allows latest features to be incorporated by directly re-runing the mlc_llm package.

We likely wont do nightly apk build in near term but hope the SDK plus a apk release after the Engine migration can help people to build apps around, please also stay tuned for future MLCEngine updates #2217 .

@sebastienbo
Copy link

sebastienbo commented May 11, 2024 via email

@tqchen
Copy link
Contributor

tqchen commented May 11, 2024

@sebastienbo sorry just wan to clarify, we would like to host APK for stable releases :) e.g. with major models, i just mention that in near term it would be harder for nightly

@sebastienbo
Copy link

sebastienbo commented May 13, 2024

@sebastienbo sorry just wan to clarify, we would like to host APK for stable releases :) e.g. with major models, i just mention that in near term it would be harder for nightly

Cool thank you

When do you expect a APK with phi 3?
How can we recognise it? Because your current APK hosted on GitHub does not have a version name in its file name

Ps: why is it necessary to download a new apk for each new model? Wouldn't it be easier to have an externally updated list? It would update the internal list. That way you can add new models just by adding them to the externally hosted list. And the user would just press a refresh button to see all the supported models (the list) that would also solve the problem of accidently deleting a model

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants