feat: HuggingFaceAPIChatGenerator add token `usage` data #8375

vblagoje · 2024-09-17T08:12:47Z

Why:

Adds token usage metadata to responses from HuggingFaceAPIChatGenerator. usage dictionary response meta field has the following two keys prompt_tokens and completion_tokens matching OpenAI format in token counting.

This feature, i.e. OpenAI token usage format compatibility, aside from chat generators interchangeability benefits, is needed for full support of Langfuse GENERATION token usage renderings in traces. See https://github.com/deepset-ai/haystack-private/issues/82 for more details.

What:

Added a new usage meta field with the keys prompt_tokens and completion_tokens to HuggingFaceAPIChatGenerator.
Modified the streaming and non-streaming response generation code to include the new usage information in the message metadata.

How can it be used:

# Get the usage information from the first reply
usage_info = response["replies"][0].meta["usage"]
prompt_tokens_used = usage_info["prompt_tokens"]
completion_tokens_used = usage_info["completion_tokens"]

print(f"Prompt tokens used: {prompt_tokens_used}, Completion tokens used: {completion_tokens_used}")

How did you test it:

Updated unit tests are included to ensure the presence of the usage meta field and its contained prompt_tokens and completion_tokens keys in the reply messages.
Tests cover non-streaming and streaming scenarios, ensuring compatibility with different API types.

Notes for the reviewer:

After this PR is merged ensure https://github.com/deepset-ai/haystack-private/issues/82 is closed as well
Support for HuggingFaceAPIGenerator hasn't been added intentionally as it is, as other non-chat generators, in the process of deprecation and eventual removal.

coveralls · 2024-09-17T08:24:52Z

Pull Request Test Coverage Report for Build 10957361582

Warning: This coverage report may be inaccurate.

This pull request's base commit is no longer the HEAD commit of its target branch. This means it includes changes from outside the original pull request, including, potentially, unrelated coverage changes.

For more information on this, see Tracking coverage changes with pull request builds.
To avoid this issue with future PRs, see these Recommended CI Configurations.
For a quick fix, rebase this PR at GitHub. Your next report should be accurate.

Details

0 of 0 changed or added relevant lines in 0 files are covered.
11 unchanged lines in 3 files lost coverage.
Overall coverage increased (+0.09%) to 90.358%

Files with Coverage Reduction	New Missed Lines	%
components/classifiers/zero_shot_document_classifier.py	3	91.07%
utils/filters.py	3	96.91%
components/evaluators/llm_evaluator.py	5	95.08%

Totals
Change from base Build 10899485113:	0.09%
Covered Lines:	7338
Relevant Lines:	8121

💛 - Coveralls

vblagoje · 2024-09-17T14:17:56Z

Please don't review this PR unless you are @anakin87

anakin87

👍

Ensure HuggingFaceAPIChatGenerator has token usage data

595db41

github-actions bot added the topic:tests label Sep 17, 2024

vblagoje added the ignore-for-release-notes PRs with this flag won't be included in the release notes. label Sep 17, 2024

vblagoje added 2 commits September 17, 2024 10:40

Merge branch 'main' into hf_api_token_counting

02802ca

Add reno note

1113f5d

vblagoje removed the ignore-for-release-notes PRs with this flag won't be included in the release notes. label Sep 17, 2024

github-actions bot added the type:documentation Improvements on the docs label Sep 17, 2024

vblagoje changed the title ~~WIP: Ensure HuggingFaceAPIChatGenerator has token usage data~~ feat: Add HuggingFaceAPI(Chat)Generator token 'usage' data Sep 17, 2024

vblagoje changed the title ~~feat: Add HuggingFaceAPI(Chat)Generator token 'usage' data~~ feat: Add HuggingFaceAPI(Chat)Generator token usage data Sep 17, 2024

vblagoje requested review from anakin87 and dfokina September 17, 2024 14:10

vblagoje marked this pull request as ready for review September 17, 2024 14:11

vblagoje requested review from a team as code owners September 17, 2024 14:11

vblagoje requested review from shadeMe and removed request for a team September 17, 2024 14:11

vblagoje removed request for a team and shadeMe September 17, 2024 14:18

Fix release note

a3dbf0d

vblagoje changed the title ~~feat: Add HuggingFaceAPI(Chat)Generator token usage data~~ feat: HuggingFaceAPIChatGenerator add token usage data Sep 20, 2024

anakin87 approved these changes Sep 23, 2024

View reviewed changes

vblagoje merged commit 09b9574 into main Sep 23, 2024
18 checks passed

vblagoje deleted the hf_api_token_counting branch September 23, 2024 13:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: HuggingFaceAPIChatGenerator add token `usage` data #8375

feat: HuggingFaceAPIChatGenerator add token `usage` data #8375

vblagoje commented Sep 17, 2024 •

edited

Loading

coveralls commented Sep 17, 2024 •

edited

Loading

vblagoje commented Sep 17, 2024

anakin87 left a comment

feat: HuggingFaceAPIChatGenerator add token usage data #8375

feat: HuggingFaceAPIChatGenerator add token usage data #8375

Conversation

vblagoje commented Sep 17, 2024 • edited Loading

Why:

What:

How can it be used:

How did you test it:

Notes for the reviewer:

coveralls commented Sep 17, 2024 • edited Loading

Pull Request Test Coverage Report for Build 10957361582

Warning: This coverage report may be inaccurate.

Details

💛 - Coveralls

vblagoje commented Sep 17, 2024

anakin87 left a comment

Choose a reason for hiding this comment

feat: HuggingFaceAPIChatGenerator add token `usage` data #8375

feat: HuggingFaceAPIChatGenerator add token `usage` data #8375

vblagoje commented Sep 17, 2024 •

edited

Loading

coveralls commented Sep 17, 2024 •

edited

Loading