Logprobs and loglikelihood #72

Red-Caesar · 2024-06-24T22:34:52Z

Update docs with logprobs and loglikelihood.

Additional notebook to run completions with these parameters.

BenHamm · 2024-06-26T18:24:05Z

fern/docs/text-gen-solution/rest-api.mdx

@@ -167,7 +173,8 @@ Parameters
     _ **content** *(string)*: The actual text content of the chat completion.  
     _ **function_call** _(object or null)_: An optional field that may contain information about a function call made within the message. It's usually `null` in standard responses.  
     _ **delta** *(object or null)*: An optional field that can contain additional metadata about the message, typically `null`.  
-     _ **finish_reason** _(string)_: The reason why the message generation was stopped, such as reaching the maximum length (`"length"`).
+     _ **finish_reason** _(string)_: The reason why the message generation was stopped, such as reaching the maximum length (`"length"`).  
+     _ **logprobs** _(object)_: An object representing the token, its log probability and the most probable tokens to this one.


"the most probable tokens to this one" what does this mean?

@Red-Caesar - Are you saying the other tokens most likely to be selected? I.e. if "cat" was chosen, but "dog" and "mouse" were the 2nd and 3rd most likely tokens to be selected, those would be included in the output?

Yes, it will be included in the response if we set the top_logprobs > 1. For example, we ask: "Create a story about a cat".
The response will be in the following format:

"choices": [ { "index": 0, "message": { "role": "assistant", "content": " In a quaint, cobblestone town,", "tool_calls": null }, "logprobs": { "content": [ { "token": " In", "logprob": -0.7101921439170837, "bytes": null, "top_logprobs": [ { "token": " In", "logprob": -0.7101921439170837, "bytes": null }, { "token": " Once", "logprob": -1.9485827684402466, "bytes": null } ] }, .... other tokens

Red-Caesar · 2024-06-27T12:07:11Z

I'm a bit concerned about the loglikelihood because if users use the OpenAI API, they may have some difficulties sending this parameter correctly. It should be sent in a specific format:

import openai

client = openai.OpenAI(
    base_url=ENDPOINT + "/v1",
    api_key=OCTOAI_TOKEN,
)

completion = client.chat.completions.create(
        model="mistral-7b-instruct",
        messages=[
            {
                "role": "user",
                "content": "Create a story about a cat",
            }
        ],
        extra_body={"loglikelihood":True},
)

So, should we write more about this? If so, I'm not sure where in the documentation it would be best to do so.

Red-Caesar · 2024-07-02T07:27:55Z

@BenHamm @devonbrown50

logprobs draft

bb37340

BenHamm reviewed Jun 26, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Logprobs and loglikelihood #72

Logprobs and loglikelihood #72

Red-Caesar commented Jun 24, 2024

BenHamm Jun 26, 2024

devonbrown50 Jun 26, 2024

Red-Caesar Jun 27, 2024

Red-Caesar commented Jun 27, 2024

Red-Caesar commented Jul 2, 2024

Logprobs and loglikelihood #72

Are you sure you want to change the base?

Logprobs and loglikelihood #72

Conversation

Red-Caesar commented Jun 24, 2024

BenHamm Jun 26, 2024

Choose a reason for hiding this comment

devonbrown50 Jun 26, 2024

Choose a reason for hiding this comment

Red-Caesar Jun 27, 2024

Choose a reason for hiding this comment

Red-Caesar commented Jun 27, 2024

Red-Caesar commented Jul 2, 2024