fix: use Gemini response metadata for token counting #11226

totsukash · 2024-11-30T05:13:05Z

Summary

Improved token counting for Gemini models by utilizing response metadata. Previously, the system used GPT-2 tokenizer which resulted in inaccurate token counts for both prompt and completion. Now, the system first attempts to get token counts from Gemini's response metadata, falling back to manual calculation only when metadata is unavailable. This change ensures accurate token counting and improves efficiency by leveraging native Gemini functionality.

Tip

Close issue syntax: Fixes #<issue number> or Resolves #<issue number>, see documentation for more details.

Screenshots

Before:	After:
...	...

Checklist

Important

Please review the checklist below before submitting your pull request.

This change requires a documentation update, included: Dify Document
I understand that this PR may be closed in case there was no previous discussion or issues. (This doesn't apply to typos!)
I've added a test for each change that was introduced, and I tried as much as possible to make a single atomic change.
I've updated the documentation accordingly.
I ran dev/reformat(backend) and cd web && npx lint-staged(frontend) to appease the lint gods

fix: use Gemini response metadata for token counting

bbcb4de

dosubot bot added the size:XS This PR changes 0-9 lines, ignoring generated files. label Nov 30, 2024

crazywoola approved these changes Nov 30, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Nov 30, 2024

crazywoola merged commit 594666e into langgenius:main Nov 30, 2024
5 checks passed

totsukash deleted the fix/gemini-token-count branch November 30, 2024 09:46

totsukash mentioned this pull request Dec 17, 2024

feat: use Gemini response metadata for token counting #11743

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: use Gemini response metadata for token counting #11226

fix: use Gemini response metadata for token counting #11226

totsukash commented Nov 30, 2024 •

edited

Loading

fix: use Gemini response metadata for token counting #11226

fix: use Gemini response metadata for token counting #11226

Conversation

totsukash commented Nov 30, 2024 • edited Loading

Summary

Screenshots

Checklist

totsukash commented Nov 30, 2024 •

edited

Loading