feat: embedded model configurations, add popular model examples, refactoring #1532

mudler · 2024-01-01T20:54:12Z

This adds docs around the setups by sharing URL and shorthands in the code (hardcoded for now) plus adds embedded model configs directly from the code. ~~It also re-adds back #1522 that was missed in #1506~~

In the embedded/models/ are defined all the yaml config files that can be run via arg command line directly by calling local-ai <model-file>. For instance, running local-ai llava now automatically sets up llava from the embedded/models/llava.yaml file, likewise, specifying a mapping in the embedded/model_library.yaml file will map a short-hand model to a full URL as already supported by #1522

Refactors the downloader code into its own package, and creates a new embedded package that allows to embed model YAML configuration file directly from the source. I've added llava and mistral for now, but planning to add popular models as well with an easy "one click" UX experience.

It is also retouching a bit some part of the docs, tangentially related to #1416

~~Todo: instead of hardcoding urls would be nicer to embed the files directly during build time~~ This is done now

Not really related to the change, this PR introduces few optimizations to the CI:

removes some duplicate image building jobs
caches gRPC builds across GHA runs

netlify · 2024-01-01T20:54:16Z

✅ Deploy Preview for localai canceled.

Name	Link
🔨 Latest commit	`9f9c5ce`
🔍 Latest deploy log	https://app.netlify.com/sites/localai/deploys/65986e164cb01b0008d237ef

mudler · 2024-01-04T13:19:50Z

docs/content/getting_started/_index.en.md

+
+| Model | Docker command |
+| --- | --- |
+| phi-2 | ```docker run -p 8080:8080 -ti --rm quay.io/go-skynet/local-ai:{{< version >}}-cublas-cuda11-core phi-2``` |


thinking again, better to have maybe llama-cpp-phi-2, vllm-phi-2, etc..

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

embedded/models/mistral-openorca.yaml

Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

….1 by renovate (#16987) This PR contains the following updates: | Package | Update | Change | |---|---|---| | [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) | patch | `v2.4.0-cublas-cuda11-ffmpeg-core` -> `v2.4.1-cublas-cuda11-ffmpeg-core` | --- > [!WARNING] > Some dependencies could not be looked up. Check the Dependency Dashboard for more information. --- ### Release Notes <details> <summary>mudler/LocalAI (docker.io/localai/localai)</summary> ### [`v2.4.1`](https://togithub.com/mudler/LocalAI/releases/tag/v2.4.1) [Compare Source](https://togithub.com/mudler/LocalAI/compare/v2.4.0...v2.4.1)  ##### What's Changed ##### Exciting New Features 🎉 - feat: embedded model configurations, add popular model examples, refactoring by [@mudler](https://togithub.com/mudler) in [https://github.com/mudler/LocalAI/pull/1532](https://togithub.com/mudler/LocalAI/pull/1532) ##### Other Changes - ⬆️ Update docs version mudler/LocalAI by [@localai-bot](https://togithub.com/localai-bot) in [https://github.com/mudler/LocalAI/pull/1546](https://togithub.com/mudler/LocalAI/pull/1546) - ⬆️ Update ggerganov/llama.cpp by [@localai-bot](https://togithub.com/localai-bot) in [https://github.com/mudler/LocalAI/pull/1547](https://togithub.com/mudler/LocalAI/pull/1547) - docs: improve getting started by [@mudler](https://togithub.com/mudler) in [https://github.com/mudler/LocalAI/pull/1553](https://togithub.com/mudler/LocalAI/pull/1553) **Full Changelog**: mudler/LocalAI@v2.4.0...v2.4.1 </details> --- ### Configuration 📅 **Schedule**: Branch creation - "before 10pm on monday" in timezone Europe/Amsterdam, Automerge - At any time (no schedule defined). 🚦 **Automerge**: Enabled. ♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox. 🔕 **Ignore**: Close this PR and you won't be reminded about this update again. --- - [ ] If you want to rebase/retry this PR, check this box --- This PR has been generated by [Renovate Bot](https://togithub.com/renovatebot/renovate).

….1 by renovate (truecharts#16987) This PR contains the following updates: | Package | Update | Change | |---|---|---| | [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) | patch | `v2.4.0-cublas-cuda11-ffmpeg-core` -> `v2.4.1-cublas-cuda11-ffmpeg-core` | --- > [!WARNING] > Some dependencies could not be looked up. Check the Dependency Dashboard for more information. --- ### Release Notes <details> <summary>mudler/LocalAI (docker.io/localai/localai)</summary> ### [`v2.4.1`](https://togithub.com/mudler/LocalAI/releases/tag/v2.4.1) [Compare Source](https://togithub.com/mudler/LocalAI/compare/v2.4.0...v2.4.1)  ##### What's Changed ##### Exciting New Features 🎉 - feat: embedded model configurations, add popular model examples, refactoring by [@&truecharts#8203;mudler](https://togithub.com/mudler) in [https://github.com/mudler/LocalAI/pull/1532](https://togithub.com/mudler/LocalAI/pull/1532) ##### Other Changes - ⬆️ Update docs version mudler/LocalAI by [@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in [https://github.com/mudler/LocalAI/pull/1546](https://togithub.com/mudler/LocalAI/pull/1546) - ⬆️ Update ggerganov/llama.cpp by [@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in [https://github.com/mudler/LocalAI/pull/1547](https://togithub.com/mudler/LocalAI/pull/1547) - docs: improve getting started by [@&truecharts#8203;mudler](https://togithub.com/mudler) in [https://github.com/mudler/LocalAI/pull/1553](https://togithub.com/mudler/LocalAI/pull/1553) **Full Changelog**: mudler/LocalAI@v2.4.0...v2.4.1 </details> --- ### Configuration 📅 **Schedule**: Branch creation - "before 10pm on monday" in timezone Europe/Amsterdam, Automerge - At any time (no schedule defined). 🚦 **Automerge**: Enabled. ♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox. 🔕 **Ignore**: Close this PR and you won't be reminded about this update again. --- - [ ] If you want to rebase/retry this PR, check this box --- This PR has been generated by [Renovate Bot](https://togithub.com/renovatebot/renovate).

mudler added the enhancement New feature or request label Jan 3, 2024

mudler commented Jan 4, 2024

View reviewed changes

mudler force-pushed the docs_update branch 2 times, most recently from 7a4eec1 to 99598a7 Compare January 5, 2024 14:56

mudler mentioned this pull request Jan 5, 2024

[Refactor]: Core/API Split #1506

Merged

mudler changed the title ~~docs: various updates, add popular model examples~~ feat: various updates, add popular model examples Jan 5, 2024

mudler added 10 commits January 5, 2024 18:12

move downloader out

38bd221

separate startup functions for preloading configuration files

9822cb4

docs: add popular model examples

17a4643

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

shorteners

283c676

Add llava

a47ffbf

Add mistral-openorca

a9d0ab6

Better link to build section

3e19d99

docs: update

7dfae7c

fixup

c37420c

Drop code dups

f104f39

mudler force-pushed the docs_update branch from 5eccc33 to f104f39 Compare January 5, 2024 17:25

Minor fixups

3c58089

mudler commented Jan 5, 2024

View reviewed changes

embedded/models/mistral-openorca.yaml Outdated Show resolved Hide resolved

mudler and others added 4 commits January 5, 2024 18:52

Apply suggestions from code review

6c65176

Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

ci: try to cache gRPC build during tests

3b5fa1a

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

ci: do not build all images for tests, just necessary

9c35ff8

ci: cache gRPC also in release pipeline

21dc249

mudler changed the title ~~feat: various updates, add popular model examples~~ feat: embedded model configurations, add popular model examples Jan 5, 2024

mudler changed the title ~~feat: embedded model configurations, add popular model examples~~ feat: embedded model configurations, add popular model examples, refactoring, docs updates Jan 5, 2024

fixes

315de23

mudler changed the title ~~feat: embedded model configurations, add popular model examples, refactoring, docs updates~~ feat: embedded model configurations, add popular model examples, refactoring Jan 5, 2024

Update model_preload_test.go

9f9c5ce

Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

mudler merged commit 09e5d90 into master Jan 5, 2024

mudler deleted the docs_update branch January 5, 2024 22:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: embedded model configurations, add popular model examples, refactoring #1532

feat: embedded model configurations, add popular model examples, refactoring #1532

mudler commented Jan 1, 2024 •

edited

Loading

netlify bot commented Jan 1, 2024 •

edited

Loading

mudler Jan 4, 2024

feat: embedded model configurations, add popular model examples, refactoring #1532

feat: embedded model configurations, add popular model examples, refactoring #1532

Conversation

mudler commented Jan 1, 2024 • edited Loading

netlify bot commented Jan 1, 2024 • edited Loading

✅ Deploy Preview for localai canceled.

mudler Jan 4, 2024

Choose a reason for hiding this comment

mudler commented Jan 1, 2024 •

edited

Loading

netlify bot commented Jan 1, 2024 •

edited

Loading