[Bug]: Python SDK introduces double-quotes #2209

jbaron · 2024-12-02T10:32:30Z

Language

Python

Version

latest

Description

When using the Python teams-ai library, the input messages receive additional double-quotes. This doesn't happen with the initial input, but happens with the sub-sequential inputs.

The action is still correct and has just the plain string, but somewhere in the process of converting the action into messages it goes wrong.

So for example when I enter in Teams, I see '"great"' back in the message history (so it has additional double-quotes).

This makes the LLM also use additional quotes in the response or format snippets badly.

When debugging the JavaScript version, this behavior was NOT observed.

Reproduction Steps

1. Enter first prompt (1 + 1) and wait for result
2. Enter second prompt (great)
3. During debugging observe that the message history has additional double-quotes around great.

jbaron · 2024-12-02T15:13:18Z

The issue seems to be with the used to_string function that doesn't checks for plain strings and as a result uses json or yaml encoding.

def to_string(tokenizer: Tokenizer, value: Any, as_json: bool = False) -> str:
    """
    Converts a value to a string representation.
    Dates are converted to ISO strings and Objects are converted to JSON or YAML,
    whichever is shorter.

    Args:
        tokenizer (Tokenizer): The tokenizer object used for encoding.
        value (Any): The value to be converted.
        as_json (bool, optional): Flag indicating whether to return the value as JSON string.
        Defaults to False.

    Returns:
        str: The string representation of the value.
    """
    if value is None:
        return ""

    if hasattr(value, "isoformat") and callable(value.isoformat):
        # Used when the value is a datetime object
        return value.isoformat()
    value = todict(value)

    if as_json:
        return json.dumps(value, default=lambda o: o.__dict__, ensure_ascii=False)

    # Return shorter version of object
    yaml_str = yaml.dump(value, allow_unicode=True)
    json_str = json.dumps(value, default=lambda o: o.__dict__, ensure_ascii=False)
    if len(tokenizer.encode(yaml_str)) < len(tokenizer.encode(json_str)):
        return yaml_str

    return json_str

If the value is of the type string, it shouldn't use JSON or YAML encoding.

Nivedipa-MSFT · 2024-12-02T16:50:35Z

@jbaron - Thank you for your inquiry about your Teams app development issue! We will check and update you soon.

sayali-MSFT · 2024-12-03T07:32:18Z

Hello @jbaron ,Thank you for your patience! We have reported this as a bug for further investigation.

We will keep you updated as soon as we receive further information. Thank you for bringing this to our attention!

corinagum · 2024-12-09T21:56:22Z

Investigating

jbaron added the bug Something isn't working label Dec 2, 2024

corinagum added the Python Change/fix applies to Python. If all three, use the 'JS & dotnet & Python' label label Dec 9, 2024

corinagum self-assigned this Dec 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: Python SDK introduces double-quotes #2209

[Bug]: Python SDK introduces double-quotes #2209

jbaron commented Dec 2, 2024

jbaron commented Dec 2, 2024

Nivedipa-MSFT commented Dec 2, 2024

sayali-MSFT commented Dec 3, 2024

corinagum commented Dec 9, 2024

[Bug]: Python SDK introduces double-quotes #2209

[Bug]: Python SDK introduces double-quotes #2209

Comments

jbaron commented Dec 2, 2024

Language

Version

Description

Reproduction Steps

jbaron commented Dec 2, 2024

Nivedipa-MSFT commented Dec 2, 2024

sayali-MSFT commented Dec 3, 2024

corinagum commented Dec 9, 2024