fix: Tokens cannot be obtained from the model dialogue #2326

shaohuzhang1 · 2025-02-19T04:06:20Z

fix: Tokens cannot be obtained from the model dialogue

f2c-ci-robot · 2025-02-19T04:06:36Z

Adding the "do-not-merge/release-note-label-needed" label because no release-note block was detected, please follow our release note process to remove it.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

shaohuzhang1 · 2025-02-19T04:06:43Z

apps/setting/models_provider/impl/base_chat_open_ai.py

@@ -121,6 +118,9 @@ def _stream(
                        generation_chunk.text, chunk=generation_chunk, logprobs=logprobs
                    )
                is_first_chunk = False
+                # custom code
+                if generation_chunk.message.usage_metadata is not None:
+                    self.usage_metadata = generation_chunk.message.usage_metadata
                yield generation_chunk

    def _create_chat_result(self,


The provided code snippet appears to be a part of an implementation for handling streaming responses from a chat model, specifically within a class called Stream. Here's a breakdown with notes on potential improvements:

Key Issues Identified:

Repeated Stream Option Setting: The stream option (stream attribute) is set multiple times without considering whether it was already defined.

Redundant Usage Metadata Handling: There seems to be redundant logic around usage metadata retrieval.

Potential Improvements:

Single Stream Option Check:
Ensure that the stream option is only set once, ideally during initialization or just before initiating the stream process. This reduces ambiguity and potential bugs.

# Single check for stream option if not kwargs.get('stream', False): kwargs["stream"] = True

Avoid Redundant Usage Metadata Retrieval:
Remove unnecessary checks and assignments related to retrieving usage metadata because it might lead to overwriting values if they are fetched more than once.

# Custom code removed for clarity

Enhance Error Handling (Optional):
While not directly addressed in this snippet, consider adding error handling mechanisms to manage edge cases such as invalid inputs or timeouts when fetching response chunks.

Suggested Changes:

Here’s how you could refactor the function based on these guidelines:

def _stream( messages: list[base_message.BaseMessage], stop: Optional[list[str]] = None, callbacks: Optional[List[CallbackHandler]] = None, verbose: bool = False, use_cache: bool = True, # Assuming there's a need for caching llm_backend=None, run_manager: Optional[ CallbackManagerForLLMRun] = None, **kwargs: Any, ) -> Iterator[ChatGenerationChunk]: """ Set default stream_options and initiate streaming response. """ if llm_backend == "azure": del kwargs["stream"] # Ensure stream option is set correctly kwargs["stream"] = kwargs.get("stream", False) if kwargs.get("stream"): # Additional setup for streaming can go here payload = self._get_request_payload(messages, stop=stop, use_cache=use_cache, **kwargs) default_chunk_class: Type[BaseMessageChunk] = AIMessageChunk base_generation_info = {} # Rest of the code remains mostly unchanged # Example usage async def main(): async for chunk in client.stream(["Hello"], use_stream=True): print(chunk.text)

Conclusion:

By ensuring the stream option is consistently managed and avoiding redundant operations concerning usage metadata, we improve both readability and robustness of the _stream method. These changes also enhance efficiency and reliability while maintaining consistency throughout the implementation.

f2c-ci-robot · 2025-02-19T04:06:45Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

fix: Tokens cannot be obtained from the model dialogue

67e54ed

f2c-ci-robot bot added the do-not-merge/release-note-label-needed label Feb 19, 2025

shaohuzhang1 commented Feb 19, 2025

View reviewed changes

shaohuzhang1 merged commit a06c5c0 into main Feb 19, 2025
4 checks passed

shaohuzhang1 deleted the pr@main@fix_chat_tokens branch February 19, 2025 04:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Tokens cannot be obtained from the model dialogue #2326

fix: Tokens cannot be obtained from the model dialogue #2326

shaohuzhang1 commented Feb 19, 2025

f2c-ci-robot bot commented Feb 19, 2025

shaohuzhang1 Feb 19, 2025

f2c-ci-robot bot commented Feb 19, 2025

fix: Tokens cannot be obtained from the model dialogue #2326

fix: Tokens cannot be obtained from the model dialogue #2326

Conversation

shaohuzhang1 commented Feb 19, 2025

f2c-ci-robot bot commented Feb 19, 2025

shaohuzhang1 Feb 19, 2025

Choose a reason for hiding this comment

Key Issues Identified:

Potential Improvements:

Suggested Changes:

Conclusion:

f2c-ci-robot bot commented Feb 19, 2025