fix: Support for Non-OpenAI Models in Token Trimming #1605

tcm390 · 2024-12-31T12:16:22Z

monilpat · 2024-12-31T18:56:04Z

THANK YOU for doing this! It has been wrong since the beginning lol so much appreciated! This is an important fix!

tcm390 · 2025-01-03T04:44:12Z

tried testing with defaults for the new settings got:
Error processing transcribed text: Error: Unknown model
    at getEncodingNameForModel (file:///root/test-prs/pr1605/packages/plugin-node/dist/index.js:10984:19)
    at encodingForModel (file:///root/test-prs/pr1605/packages/plugin-node/dist/index.js:11014:24)
    at _TokenizationService.truncateTiktoken (file:///root/test-prs/pr1605/packages/plugin-node/dist/index.js:11055:26)
    at _TokenizationService.trimTokens (file:///root/test-prs/pr1605/packages/plugin-node/dist/index.js:11036:29)
    at generateMessageResponse (file:///root/test-prs/pr1605/packages/core/dist/index.js:2556:41)
    at VoiceManager2._generateResponse (file:///root/test-prs/pr1605/packages/client-discord/dist/index.js:3178:32)
    at VoiceManager2.handleUserMessage (file:///root/test-prs/pr1605/packages/client-discord/dist/index.js:3084:48)
    at async VoiceManager2.processTranscription (file:///root/test-prs/pr1605/packages/client-discord/dist/index.js:3029:17)
    at async Timeout._onTimeout (file:///root/test-prs/pr1605/packages/client-discord/dist/index.js:2961:17)
with one of my discord character files:
"modelProvider": "anthropic",
this probably needs to work out of the box
marking as a draft so it doesn't get merged
Yeah, since TikToken does not support the Anthropic model, you need to specify the TOKENIZER_MODEL and TOKENIZER_TYPE in env. However, I think maybe I need to fallback to a hardcoded value if people don't specify one or the tokenizer fails. 🤔
Yes, should have default values.

Updated. So now, if people don't specify TOKENIZER_MODEL and TOKENIZER_TYPE, trimTokens will use a hardcoded TikToken tokenizer with gpt-4-mini as the model.

shakkernerd · 2025-01-03T04:45:15Z

tried testing with defaults for the new settings got:
Error processing transcribed text: Error: Unknown model
    at getEncodingNameForModel (file:///root/test-prs/pr1605/packages/plugin-node/dist/index.js:10984:19)
    at encodingForModel (file:///root/test-prs/pr1605/packages/plugin-node/dist/index.js:11014:24)
    at _TokenizationService.truncateTiktoken (file:///root/test-prs/pr1605/packages/plugin-node/dist/index.js:11055:26)
    at _TokenizationService.trimTokens (file:///root/test-prs/pr1605/packages/plugin-node/dist/index.js:11036:29)
    at generateMessageResponse (file:///root/test-prs/pr1605/packages/core/dist/index.js:2556:41)
    at VoiceManager2._generateResponse (file:///root/test-prs/pr1605/packages/client-discord/dist/index.js:3178:32)
    at VoiceManager2.handleUserMessage (file:///root/test-prs/pr1605/packages/client-discord/dist/index.js:3084:48)
    at async VoiceManager2.processTranscription (file:///root/test-prs/pr1605/packages/client-discord/dist/index.js:3029:17)
    at async Timeout._onTimeout (file:///root/test-prs/pr1605/packages/client-discord/dist/index.js:2961:17)
with one of my discord character files:
"modelProvider": "anthropic",
this probably needs to work out of the box
marking as a draft so it doesn't get merged
Yeah, since TikToken does not support the Anthropic model, you need to specify the TOKENIZER_MODEL and TOKENIZER_TYPE in env. However, I think maybe I need to fallback to a hardcoded value if people don't specify one or the tokenizer fails. 🤔
Yes, should have default values.
Updated. So now, if people don't specify TOKENIZER_MODEL and TOKENIZER_TYPE, trimTokens will use a hardcoded TikToken tokenizer with gpt-4-mini as the model.

Great!

tcm390 · 2025-01-03T04:47:14Z

But I'm not sure if we want to make it a service. If we can resolve the onnxruntime-node issue, maybe we could move it to the core.

monilpat

LGTM - thanks for creating a thoughtful solution to this. It could use some more thorough testing - given lots of callsites are changed, but fundamentally looks good. It could be moved into core as well and probably should be

We should consider adding another wrapper trimTokensByModelClass(context, modelClass) which under the hood gets the model and the maxInputTokens from it and use that in generateObject etc wherever we are doing it based off of the model defaults and not a configured value.

packages/plugin-node/src/services/tokenizer.ts

tcm390 · 2025-01-03T14:03:02Z

LGTM - thanks for creating a thoughtful solution to this. It could use some more thorough testing - given lots of callsites are changed, but fundamentally looks good. It could be moved into core as well and probably should be

We should consider adding another wrapper trimTokensByModelClass(context, modelClass) which under the hood gets the model and the maxInputTokens from it and use that in generateObject etc wherever we are doing it based off of the model defaults and not a configured value.

updated. moved it to core

…encoding.

tcm390 · 2025-01-03T14:37:31Z

Test Scenarios:

Unset Variables: Verified behavior when TOKENIZER_MODEL and TOKENIZER_TYPE are not set.
Incorrect Configuration (Both Invalid): Tested with invalid values for both TOKENIZER_MODEL and TOKENIZER_TYPE.
Partially Incorrect Configuration: Tested with an invalid TOKENIZER_MODEL but a valid TOKENIZER_TYPE.
Correct Configuration: Tested with valid values for both TOKENIZER_MODEL and TOKENIZER_TYPE.

Results:
All scenarios functioned as expected:

Unset Variables: Default tiktoken tokenizer was used with "gpt-4o" for token truncation.
Incorrect Configuration (Both Invalid): Fallback to the default tiktoken tokenizer and "gpt-4o" for token truncation.
Partially Incorrect Configuration: Rough truncation performed using an estimated 4 characters per token.
Correct Configuration: Tokens were truncated successfully with the provided TOKENIZER_MODEL and TOKENIZER_TYPE.

shakkernerd

This is well done.
Great work!

fix: Support for Non-OpenAI Models in Token Trimming

tcm390 added 4 commits December 31, 2024 07:11

add tokenization service

a1fe889

add tokenization service

0eaae1f

add tokenization service

1d08e17

use tokenization service to trim tokens

5687fe8

tcm390 marked this pull request as draft December 31, 2024 12:16

This was referenced Dec 31, 2024

Bug: generateText is ignoring dynamic parameters due to a hard-coded model class #1439

Closed

Expand Support for Non-OpenAI Models in Token Trimming #1565

Closed

tcm390 changed the title ~~trim tokens~~ Support for Non-OpenAI Models in Token Trimming Dec 31, 2024

tcm390 added 7 commits December 31, 2024 14:46

remove trimtokens function

ddb1be5

remove trimtokens test

12c7ad1

use tokenization service to trim token

c3d188a

use tokenization service to trim token

92077ff

use tokenization service to trim token

9402599

use tokenization service to trim token

b74194c

use tokenization service to trim token

fc638e8

tcm390 force-pushed the tcm-trimTokens branch from 6ca727a to fc638e8 Compare December 31, 2024 19:55

tokenizer setings

af97657

tcm390 force-pushed the tcm-trimTokens branch from 8f5441a to af97657 Compare December 31, 2024 20:03

Merge branch 'develop' into tcm-trimTokens

61a55c7

tcm390 marked this pull request as ready for review December 31, 2024 20:44

Merge branch 'develop' into tcm-trimTokens

d9f56a9

monilpat changed the title ~~Support for Non-OpenAI Models in Token Trimming~~ fix: Support for Non-OpenAI Models in Token Trimming Jan 1, 2025

lalalune and others added 3 commits January 1, 2025 20:30

Merge branch 'develop' into tcm-trimTokens

b465438

Merge branch 'develop' into tcm-trimTokens

f31e9c4

Merge branch 'develop' into tcm-trimTokens

34acac4

tcm390 mentioned this pull request Jan 3, 2025

Ensure uniform application of trimTokens in underlying LLM calls #1657

Open

odilitime added 2 commits January 2, 2025 19:13

Merge branch 'develop' into tcm-trimTokens

9ee9f0a

Merge branch 'develop' into tcm-trimTokens

c9f0b0a

shakkernerd added the testing label Jan 3, 2025

monilpat previously approved these changes Jan 3, 2025

View reviewed changes

packages/plugin-node/src/services/tokenizer.ts Outdated Show resolved Hide resolved

packages/plugin-node/src/services/tokenizer.ts Outdated Show resolved Hide resolved

use 4o as default model

70d90ab

tcm390 dismissed monilpat’s stale review via 70d90ab January 3, 2025 12:13

tcm390 force-pushed the tcm-trimTokens branch from 2950459 to 70d90ab Compare January 3, 2025 12:13

tcm390 and others added 6 commits January 3, 2025 07:14

Merge branch 'develop' into tcm-trimTokens

12a1acc

use elizaLogger

be5319a

move trimTokens to core

e2dbeb3

clean code

5555f85

restore test

3bd3622

clean code

7c45e2a

remove tokenizer service

2365b54

tcm390 force-pushed the tcm-trimTokens branch from c8978a1 to 2365b54 Compare January 3, 2025 14:06

fall back if unsupported type

930f91e

tcm390 force-pushed the tcm-trimTokens branch from 6f2354e to 930f91e Compare January 3, 2025 14:20

tcm390 and others added 2 commits January 3, 2025 09:20

Merge branch 'develop' into tcm-trimTokens

8afb612

Move encoding into try block to handle potential errors during model …

94caa7e

…encoding.

shakkernerd marked this pull request as ready for review January 3, 2025 14:50

shakkernerd added 2 commits January 3, 2025 14:58

feat: add JsDoc to trimTokens function

226cd65

feat: add validation to trimTokens

616ca1f

shakkernerd approved these changes Jan 3, 2025

View reviewed changes

shakkernerd merged commit bf6ef96 into develop Jan 3, 2025
7 checks passed

shakkernerd deleted the tcm-trimTokens branch January 3, 2025 15:14

odilitime mentioned this pull request Jan 4, 2025

chore: Develop => main for 1.7.0 release #1717

Merged

tcm390 mentioned this pull request Jan 8, 2025

support default grok in generate object for twitter plugin #1983

Closed

1to3for5vi7ate9x pushed a commit to 1to3for5vi7ate9x/eliza that referenced this pull request Jan 26, 2025

Merge pull request elizaOS#1605 from elizaOS/tcm-trimTokens

abb3e7c

fix: Support for Non-OpenAI Models in Token Trimming

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Support for Non-OpenAI Models in Token Trimming #1605

fix: Support for Non-OpenAI Models in Token Trimming #1605

tcm390 commented Dec 31, 2024 •

edited

Loading

monilpat commented Dec 31, 2024

tcm390 commented Jan 3, 2025

shakkernerd commented Jan 3, 2025

tcm390 commented Jan 3, 2025

monilpat left a comment •

edited

Loading

tcm390 commented Jan 3, 2025

tcm390 commented Jan 3, 2025

shakkernerd left a comment

fix: Support for Non-OpenAI Models in Token Trimming #1605

fix: Support for Non-OpenAI Models in Token Trimming #1605

Conversation

tcm390 commented Dec 31, 2024 • edited Loading

monilpat commented Dec 31, 2024

tcm390 commented Jan 3, 2025

shakkernerd commented Jan 3, 2025

tcm390 commented Jan 3, 2025

monilpat left a comment • edited Loading

Choose a reason for hiding this comment

tcm390 commented Jan 3, 2025

tcm390 commented Jan 3, 2025

shakkernerd left a comment

Choose a reason for hiding this comment

tcm390 commented Dec 31, 2024 •

edited

Loading

monilpat left a comment •

edited

Loading