Skip to content
This repository has been archived by the owner on Nov 3, 2023. It is now read-only.

[WizInt, K2R] Additional knowledge-related eval metrics on WizInt #4193

Merged
merged 2 commits into from
Nov 17, 2021

Conversation

mojtaba-komeili
Copy link
Contributor

Patch description
Added a couple of extra metrics to WizInt to evaluate the generated response's F1 with the best possible match in the documents and sentences:

  • knowledge_f1_max_retrieved_sentences: max F1 between response and any of the retrieved sentences.
  • knowledge_f1_max_retrieved_docs: max F1 between response and any of the retrieved docs.

Right now we only compare to selected docs and sentences.

Testing steps
Checking if they appear during eval.

Screen Shot 2021-11-17 at 3 42 27 PM

@mojtaba-komeili mojtaba-komeili merged commit 4779018 into main Nov 17, 2021
@mojtaba-komeili mojtaba-komeili deleted the max-f1 branch November 17, 2021 22:05
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants