magda-llm-service-worker-extension

A Google Chrome extension runs LLM in extension background service worker to support Magda web application frontend chatbot.

Thanks to MLC LLM & WebLLM project. By leveraging WebGPU API, this extension can assist web applications in running the LLM engine within the web browser.

Under the web application frontend code's requests, the extension will download the requested LLM model (around 4GB download depending on models), cache locally and run it in an extension background service worker. This service worker will only remain "active" when any web pages require it to remain active.

How to install

The extension has been published to Google Chrome Web Store and you can install it from [there](https://chromewebstore.google.com/detail/magda-llm-service-worker/ljadmjdilnpmlhopijgimonfackfngmi.

Please note: the published version only allows connections from *.magda.io (This can be adjusted via manifest.json. See How to build for more details). If you need a version that accepts connections from any domains (including localhost) for testing & evaluation. Please follow the instructions below to install the release versions from the release page.

To install a version from the release page (rather than Google Chrome Web Store), you need to follow the manual steps below:

1> Download a release version from the Releases page of this repo. e.g. magda-llm-service-worker-extension-v1.0.0.zip
2> Unzip the downloaded zip file
3> Open the URL chrome://extensions in a Chrome tab to open the Chrome extension management UI.
4> Tick the "Developer mode" toggle in the top-right corner.

5> Click the "Load unpacked" button in the top-left corner and select the previously downloaded & unzipped folder.

If you want to restrict the LLM model access to a specified domain, please modify the manifest.json in the unzipped folder after step 2 above. Modify the externally_connectable.matches array to set its value to your preferred domain. e.g. "matches": ["https://*.mydomain.com/*"]. More see here.

How to build

Before building, you can modify the manifest.json to restrict the LLM model access to a specified domain(s). To do that, you can modify the externally_connectable.matches array in manifest.json to set its value to your preferred domain. e.g. "matches": ["https://*.mydomain.com/*"]. More see here.

Install Node.js & yarn
Run yarn install at the project root
Run yarn build to build. You can find built files in the dist folder.
Created a zip file that contains all files/folders inside the dist folder
You can use the zip file to install locally (see How to install section) or publish to the Google Chrome Web Store

The key in the manifest.json is generated during the publishing process of Google Webstore DevConsole. To publish your own version of the extension, you can generate a key, update it in manifest.json and build your own version. Please note that the extension ID (required when connecting to the extension from a web application) will change when changes the key.

Why need an extension

WebGPU API can be accessed from web pages directly without the help of extensions. However, in Google Chrome, when the setting "Clear cookies and site data when you close all windows" is selected, there is only around 300MB of local storage available, which is not sufficient to store a 7B LLM model locally. Moreover, on organisation-managed devices (e.g. company laptops), this setting often is turned on and can be changed by the user. More see this issue.

As we can request unlimitedStorage permission via this extension, the LLM model can always be downloaded locally without requiring users to change any browser settings.

Permissions Request

When install this extension, it will request the following permissions:

"unlimitedStorage": be able to save the LLM model locally so that we don't need to re-download it after the web page is closed.
- The cached local data will be removed when you remove this extension from Google Chrome.
externally_connectable: Allow web application to connect to the extension background service worker from a web page via Long-lived connections.
- By default, the extension's manifest.json allows connections from any domain. You can modify the manifest.json to restrict access for your own project. More see "How to install" section.

Support LLM Models

Please find the list here: https://mlc.ai/models

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
docs		docs
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package.json		package.json
prettier.config.js		prettier.config.js
privacy-policy.md		privacy-policy.md
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

magda-llm-service-worker-extension

How to install

How to build

Why need an extension

Permissions Request

Support LLM Models

About

Releases 5

Packages

Languages

License

magda-io/magda-llm-service-worker-extension

Folders and files

Latest commit

History

Repository files navigation

magda-llm-service-worker-extension

How to install

How to build

Why need an extension

Permissions Request

Support LLM Models

About

Resources

License

Stars

Watchers

Forks

Releases 5

Packages 0

Languages

Packages