fuzzy match gives the wrong answer in eval #139

cheng-tan · 2024-05-21T19:42:05Z

for task 361:

our agent gave the answer: Order number 170 is Canceled, order number 189 is Pending

the evaluator is using fuzzy match and evaluated our answer as wrong:

        "eval_types": [
            "string_match"
        ],
        "reference_answers": {
            "fuzzy_match": [
                "170: cancelled",
                "189: pending"
            ]
        },
        "reference_url": "",
        "program_html": [],
        "string_note": "",
        "reference_answer_raw_annotation": "170: cancelled, 189: pending"
    },

The text was updated successfully, but these errors were encountered:

minghchen · 2024-07-12T09:40:45Z

In my opinion, line 165 of 'StringEvaluator' in evaluation_harness.evaluator_router should be revised to:

assert isinstance(value, list)
score *= self.fuzzy_match(
    ref=" ".join(value), pred=pred, intent=intent
)

The original code will compare each individual item in the 'fuzzy_match' list with the prediction, but the prediction should be compared with the whole list.

shuyanzhou added the annotation issue label May 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fuzzy match gives the wrong answer in eval #139

fuzzy match gives the wrong answer in eval #139

cheng-tan commented May 21, 2024

minghchen commented Jul 12, 2024

fuzzy match gives the wrong answer in eval #139

fuzzy match gives the wrong answer in eval #139

Comments

cheng-tan commented May 21, 2024

minghchen commented Jul 12, 2024