added evaluation custom criteria #30

oindrillac · 2024-01-22T16:22:19Z

This PR expands on the evaluation metrics by using custom langchain evaluation metrics which utilize a LLM in the backend to score model generated outputs on custom criteria and some pre-defined criteria.

In this iteration, for demonstration purposes, we add criteria such as grammatical correctness, helpfulness and descriptiveness.

Based on the actual requirements, these criteria can be updated.

Discussions on evaluation metrics can be found here:https://docs.google.com/document/d/1I2ea71M6E8SJBYe9k1Gzw2lfshppikwf4ndKtlEClNk/edit?usp=sharing

More information on Langchain custom evaluation criteria can be found here: https://python.langchain.com/docs/guides/evaluation/string/criteria_eval_chain

oindrillac · 2024-01-22T16:22:36Z

Updated App: https://ui-api-doc-gen.apps.platform-sts.pcbk.p1.openshiftapps.com/

aakankshaduggal

/lgtm

added custom criteria

4826914

oindrillac requested a review from aakankshaduggal January 22, 2024 16:22

aakankshaduggal approved these changes Jan 25, 2024

View reviewed changes

aakankshaduggal merged commit 6d05625 into redhat-et:main Jan 25, 2024
0 of 2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added evaluation custom criteria #30

added evaluation custom criteria #30

oindrillac commented Jan 22, 2024

oindrillac commented Jan 22, 2024

aakankshaduggal left a comment

added evaluation custom criteria #30

added evaluation custom criteria #30

Conversation

oindrillac commented Jan 22, 2024

oindrillac commented Jan 22, 2024

aakankshaduggal left a comment

Choose a reason for hiding this comment