Skip to content

feat: embedded model configurations, add popular model examples, refactoring #1532

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 17 commits into from
Jan 5, 2024

Conversation

mudler
Copy link
Owner

@mudler mudler commented Jan 1, 2024

This adds docs around the setups by sharing URL and shorthands in the code (hardcoded for now) plus adds embedded model configs directly from the code. It also re-adds back #1522 that was missed in #1506

In the embedded/models/ are defined all the yaml config files that can be run via arg command line directly by calling local-ai <model-file>. For instance, running local-ai llava now automatically sets up llava from the embedded/models/llava.yaml file, likewise, specifying a mapping in the embedded/model_library.yaml file will map a short-hand model to a full URL as already supported by #1522

Refactors the downloader code into its own package, and creates a new embedded package that allows to embed model YAML configuration file directly from the source. I've added llava and mistral for now, but planning to add popular models as well with an easy "one click" UX experience.

It is also retouching a bit some part of the docs, tangentially related to #1416

Todo: instead of hardcoding urls would be nicer to embed the files directly during build time This is done now

Not really related to the change, this PR introduces few optimizations to the CI:

  • removes some duplicate image building jobs
  • caches gRPC builds across GHA runs

Copy link

netlify bot commented Jan 1, 2024

Deploy Preview for localai canceled.

Name Link
🔨 Latest commit 9f9c5ce
🔍 Latest deploy log https://app.netlify.com/sites/localai/deploys/65986e164cb01b0008d237ef

@mudler mudler added the enhancement New feature or request label Jan 3, 2024

| Model | Docker command |
| --- | --- |
| phi-2 | ```docker run -p 8080:8080 -ti --rm quay.io/go-skynet/local-ai:{{< version >}}-cublas-cuda11-core phi-2``` |
Copy link
Owner Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thinking again, better to have maybe llama-cpp-phi-2, vllm-phi-2, etc..

@mudler mudler force-pushed the docs_update branch 2 times, most recently from 7a4eec1 to 99598a7 Compare January 5, 2024 14:56
@mudler mudler changed the title docs: various updates, add popular model examples feat: various updates, add popular model examples Jan 5, 2024
mudler and others added 4 commits January 5, 2024 18:52
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
@mudler mudler changed the title feat: various updates, add popular model examples feat: embedded model configurations, add popular model examples Jan 5, 2024
@mudler mudler changed the title feat: embedded model configurations, add popular model examples feat: embedded model configurations, add popular model examples, refactoring, docs updates Jan 5, 2024
@mudler mudler changed the title feat: embedded model configurations, add popular model examples, refactoring, docs updates feat: embedded model configurations, add popular model examples, refactoring Jan 5, 2024
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
@mudler mudler merged commit 09e5d90 into master Jan 5, 2024
@mudler mudler deleted the docs_update branch January 5, 2024 22:16
truecharts-admin referenced this pull request in truecharts/public Jan 8, 2024
….1 by renovate (#16987)

This PR contains the following updates:

| Package | Update | Change |
|---|---|---|
| [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) |
patch | `v2.4.0-cublas-cuda11-ffmpeg-core` ->
`v2.4.1-cublas-cuda11-ffmpeg-core` |

---

> [!WARNING]
> Some dependencies could not be looked up. Check the Dependency
Dashboard for more information.

---

### Release Notes

<details>
<summary>mudler/LocalAI (docker.io/localai/localai)</summary>

### [`v2.4.1`](https://togithub.com/mudler/LocalAI/releases/tag/v2.4.1)

[Compare
Source](https://togithub.com/mudler/LocalAI/compare/v2.4.0...v2.4.1)

<!-- Release notes generated using configuration in .github/release.yml
at master -->

##### What's Changed

##### Exciting New Features 🎉

- feat: embedded model configurations, add popular model examples,
refactoring by [@&#8203;mudler](https://togithub.com/mudler) in
[https://github.com/mudler/LocalAI/pull/1532](https://togithub.com/mudler/LocalAI/pull/1532)

##### Other Changes

- ⬆️ Update docs version mudler/LocalAI by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[https://github.com/mudler/LocalAI/pull/1546](https://togithub.com/mudler/LocalAI/pull/1546)
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[https://github.com/mudler/LocalAI/pull/1547](https://togithub.com/mudler/LocalAI/pull/1547)
- docs: improve getting started by
[@&#8203;mudler](https://togithub.com/mudler) in
[https://github.com/mudler/LocalAI/pull/1553](https://togithub.com/mudler/LocalAI/pull/1553)

**Full Changelog**:
mudler/LocalAI@v2.4.0...v2.4.1

</details>

---

### Configuration

📅 **Schedule**: Branch creation - "before 10pm on monday" in timezone
Europe/Amsterdam, Automerge - At any time (no schedule defined).

🚦 **Automerge**: Enabled.

♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the
rebase/retry checkbox.

🔕 **Ignore**: Close this PR and you won't be reminded about this update
again.

---

- [ ] <!-- rebase-check -->If you want to rebase/retry this PR, check
this box

---

This PR has been generated by [Renovate
Bot](https://togithub.com/renovatebot/renovate).

<!--renovate-debug:eyJjcmVhdGVkSW5WZXIiOiIzNy4xMjYuMiIsInVwZGF0ZWRJblZlciI6IjM3LjEyNi4yIiwidGFyZ2V0QnJhbmNoIjoibWFzdGVyIn0=-->
GabrielBarzen referenced this pull request in GabrielBarzen/charts Feb 2, 2024
….1 by renovate (truecharts#16987)

This PR contains the following updates:

| Package | Update | Change |
|---|---|---|
| [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) |
patch | `v2.4.0-cublas-cuda11-ffmpeg-core` ->
`v2.4.1-cublas-cuda11-ffmpeg-core` |

---

> [!WARNING]
> Some dependencies could not be looked up. Check the Dependency
Dashboard for more information.

---

### Release Notes

<details>
<summary>mudler/LocalAI (docker.io/localai/localai)</summary>

### [`v2.4.1`](https://togithub.com/mudler/LocalAI/releases/tag/v2.4.1)

[Compare
Source](https://togithub.com/mudler/LocalAI/compare/v2.4.0...v2.4.1)

<!-- Release notes generated using configuration in .github/release.yml
at master -->

##### What's Changed

##### Exciting New Features 🎉

- feat: embedded model configurations, add popular model examples,
refactoring by [@&truecharts#8203;mudler](https://togithub.com/mudler) in
[https://github.com/mudler/LocalAI/pull/1532](https://togithub.com/mudler/LocalAI/pull/1532)

##### Other Changes

- ⬆️ Update docs version mudler/LocalAI by
[@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in
[https://github.com/mudler/LocalAI/pull/1546](https://togithub.com/mudler/LocalAI/pull/1546)
- ⬆️ Update ggerganov/llama.cpp by
[@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in
[https://github.com/mudler/LocalAI/pull/1547](https://togithub.com/mudler/LocalAI/pull/1547)
- docs: improve getting started by
[@&truecharts#8203;mudler](https://togithub.com/mudler) in
[https://github.com/mudler/LocalAI/pull/1553](https://togithub.com/mudler/LocalAI/pull/1553)

**Full Changelog**:
mudler/LocalAI@v2.4.0...v2.4.1

</details>

---

### Configuration

📅 **Schedule**: Branch creation - "before 10pm on monday" in timezone
Europe/Amsterdam, Automerge - At any time (no schedule defined).

🚦 **Automerge**: Enabled.

♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the
rebase/retry checkbox.

🔕 **Ignore**: Close this PR and you won't be reminded about this update
again.

---

- [ ] <!-- rebase-check -->If you want to rebase/retry this PR, check
this box

---

This PR has been generated by [Renovate
Bot](https://togithub.com/renovatebot/renovate).

<!--renovate-debug:eyJjcmVhdGVkSW5WZXIiOiIzNy4xMjYuMiIsInVwZGF0ZWRJblZlciI6IjM3LjEyNi4yIiwidGFyZ2V0QnJhbmNoIjoibWFzdGVyIn0=-->
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant