[WASM] Update prebuilt wasms to version 0.2.39 #123

CharlieFRuan · 2024-05-29T10:02:24Z

Compiled at:

MLC-LLM at mlc-ai/mlc-llm@dc091e7
TVM at https://github.com/CharlieFRuan/tvm/tree/fix-0528-vectorize
- Equivalent to main branch apache/tvm@b598f28

Main changes include:

Grammar (i.e. json mode) support for Llama3
Post process of tokens done with MLC's runtime WASM
New models:
- Phi3-mini-4k
- Hermes-2-Pro-Llama-3-8B
- Qwen1.5-1.8B
- StableLM-2-zephyr_1.6B

…433) This PR updates models to v0.2.39 compiled with mlc-ai/binary-mlc-llm-libs#123 The main change is the new MLC-LLM runtime, which supports grammar (i.e. json mode) for Llama3. - Hence we now read in field `tokenizer_info` (or deprecated `token_table_postproc_method`) from `mlc-chat-config.json` when post processing token table for Grammar - If neither is available, we use the default `byte_fallback` New prebuilt models introduced: - Phi3-mini-4k - Hermes-2-Pro-Llama-3-8B - Qwen1.5-1.8B - StableLM-2-zephyr_1.6B Updates on examples: - json-mode and json-schema now use Llama3 to demonstrate - Function calling inside json-schema now uses `Hermes-2-Pro-Llama-3-8B` instead of `Hermes-2-Pro-Mistral`

### Changes Main changes include: - New prebuilt models: - Phi3-mini - StableLM-2-zephyr-1.6B - Qwen1.5-1.8B - Hermes2-Pro-Llama-3-8B to prebuilt models - Updates on `ModelRecord` fields - For detail see: #435 - Update all WASMs - For detail see: #433 - Update all WASMs to v0.2.39 - Support grammar for Llama3, hence update examples/json-mode to use `Llama3` and `Hermes2-pro-Llama3-8B` for function calling in `examples/json-schema` - Use `loglevel` package: - For details see #427 - Fix `index.js.map` issue for Vite - #420 - Enhance error handling and ServiceWorker ### TVMjs TVMjs compiled at apache/tvm@71f7af7 - Main changes include: - apache/tvm#17031 - apache/tvm#17028 - apache/tvm#17021 ### WASM version - All wasms updated to 0.2.39 via mlc-ai/binary-mlc-llm-libs#123 for new MLC-LLM runtime (mainly grammar)

…lc-ai#433) This PR updates models to v0.2.39 compiled with mlc-ai/binary-mlc-llm-libs#123 The main change is the new MLC-LLM runtime, which supports grammar (i.e. json mode) for Llama3. - Hence we now read in field `tokenizer_info` (or deprecated `token_table_postproc_method`) from `mlc-chat-config.json` when post processing token table for Grammar - If neither is available, we use the default `byte_fallback` New prebuilt models introduced: - Phi3-mini-4k - Hermes-2-Pro-Llama-3-8B - Qwen1.5-1.8B - StableLM-2-zephyr_1.6B Updates on examples: - json-mode and json-schema now use Llama3 to demonstrate - Function calling inside json-schema now uses `Hermes-2-Pro-Llama-3-8B` instead of `Hermes-2-Pro-Mistral`

### Changes Main changes include: - New prebuilt models: - Phi3-mini - StableLM-2-zephyr-1.6B - Qwen1.5-1.8B - Hermes2-Pro-Llama-3-8B to prebuilt models - Updates on `ModelRecord` fields - For detail see: mlc-ai#435 - Update all WASMs - For detail see: mlc-ai#433 - Update all WASMs to v0.2.39 - Support grammar for Llama3, hence update examples/json-mode to use `Llama3` and `Hermes2-pro-Llama3-8B` for function calling in `examples/json-schema` - Use `loglevel` package: - For details see mlc-ai#427 - Fix `index.js.map` issue for Vite - mlc-ai#420 - Enhance error handling and ServiceWorker ### TVMjs TVMjs compiled at apache/tvm@71f7af7 - Main changes include: - apache/tvm#17031 - apache/tvm#17028 - apache/tvm#17021 ### WASM version - All wasms updated to 0.2.39 via mlc-ai/binary-mlc-llm-libs#123 for new MLC-LLM runtime (mainly grammar)

[WASM] Update prebuilt wasms to version 0.2.39

87bde2b

CharlieFRuan force-pushed the pr-wasm-0_2_39 branch from 52452ec to 87bde2b Compare May 29, 2024 10:16

CharlieFRuan marked this pull request as ready for review May 29, 2024 21:32

CharlieFRuan merged commit d87aa9e into mlc-ai:main May 29, 2024

CharlieFRuan mentioned this pull request May 29, 2024

[Models] Add Phi3-mini, StableLM 1.6B, Qwen 1.8B, update MLC runtime mlc-ai/web-llm#433

Merged

CharlieFRuan mentioned this pull request May 30, 2024

[Version] Bump version to 0.2.39, update prebuilt WASMs mlc-ai/web-llm#436

Merged

This was referenced Jun 4, 2024

[WASM] Add Mistral v0.3 and TinyLlama v1.0 #124

Merged

[WASM] Add prebuilt for QWen2 #125

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WASM] Update prebuilt wasms to version 0.2.39 #123

[WASM] Update prebuilt wasms to version 0.2.39 #123

CharlieFRuan commented May 29, 2024 •

edited

Loading

[WASM] Update prebuilt wasms to version 0.2.39 #123

[WASM] Update prebuilt wasms to version 0.2.39 #123

Conversation

CharlieFRuan commented May 29, 2024 • edited Loading

CharlieFRuan commented May 29, 2024 •

edited

Loading