Update binarization to be individual params #40168

nagkumar91 · 2025-03-20T22:11:12Z

Description

Please add an informative description that covers that changes made by the pull request and link all relevant issues.

If an SDK is being regenerated based on a new swagger spec, a link to the pull request containing these swagger spec changes has been included above.

All SDK Contribution checklist:

The pull request does not introduce [breaking changes]
CHANGELOG is updated for new features, bug fixes or other significant changes.
I have read the contribution guidelines.

General Guidelines and Best Practices

Title of the pull request is clear and informative.
There are a small number of commits, each of which have an informative message. This means that previously merged commits do not appear in the history of the PR. For more information on cleaning up the commits in your PR, see this page.

Testing Guidelines

Pull request includes test coverage for the included changes.

remove required keys

Pull Request Overview

This PR refactors evaluators to accept individual threshold parameters instead of using a single dictionary-based threshold, improving parameter clarity and type safety.

Updated QAEvaluator to have separate parameters for groundedness, relevance, coherence, fluency, similarity, and f1_score thresholds.
Updated ContentSafetyEvaluator to accept individual thresholds for violence, sexual content, self-harm, and hate/unfairness evaluations.
Updated RougeScoreEvaluator, sample usage, and tests to use individual threshold parameters.

Reviewed Changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_qa/_qa.py	Replaced dictionary-based thresholds with individual parameters and updated type checking.
sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_content_safety/_content_safety.py	Modified threshold parameter to individual thresholds with type checking for int.
sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_rouge/_rouge.py	Updated threshold parameters to individual floats with corresponding type checks.
sdk/evaluation/azure-ai-evaluation/samples/evaluation_samples_threshold.py	Updated usage examples for QAEvaluator and RougeScoreEvaluator to reflect individual thresholds.
sdk/evaluation/azure-ai-evaluation/tests/unittests/test_evaluators/test_threshold_behavior.py	Updated tests to use individual threshold parameters instead of dictionaries.

Comments suppressed due to low confidence (1)

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_qa/_qa.py:82

Consider enforcing a stricter type check for 'f1_score_threshold' (e.g., ensure it is a float) rather than accepting an int, to be consistent with its documented type.

for name, value in [

azure-sdk · 2025-03-20T22:30:47Z

API change check

APIView has identified API level changes in this PR and created following API reviews.

azure-ai-evaluation

nagkumar91 and others added 30 commits October 1, 2024 14:51

Update task_query_response.prompty

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

7de6367

remove required keys

Update task_simulate.prompty

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

Loading
Loading status checks…

f288b34

Update task_query_response.prompty

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

Loading
Loading status checks…

2a4b6f7

Update task_simulate.prompty

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

Loading
Loading status checks…

c8ce251

Merge branch 'Azure:main' into main

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

Loading
Loading status checks…

4522ae4

Merge branch 'Azure:main' into main

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

Loading
Loading status checks…

32e9c1d

Merge branch 'Azure:main' into main

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

Loading
Loading status checks…

76df69d

Merge branch 'Azure:main' into main

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

Loading
Loading status checks…

aeddcb4

Merge branch 'Azure:main' into main

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

Loading
Loading status checks…

65a759c

Fix the api_key needed

Loading
Loading status checks…

e4cdd30

Merge branch 'Azure:main' into main

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

e3ab026

Merge branch 'Azure:main' into main

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

4fb09c4

Merge branch 'Azure:main' into main

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

e71a52d

Merge branch 'Azure:main' into main

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

87166b3

Update for release

Loading
Loading status checks…

b478651

Black fix for file

Loading
Loading status checks…

8e5a264

Merge branch 'Azure:main' into main

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

2077d6d

Merge branch 'Azure:main' into main

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

3ab59c8

Add original text in global context

3a80606

Update test

Loading
Loading status checks…

6768f9a

Update the indirect attack simulator

f7cc4bb

Black suggested fixes

07eb466

Update simulator prompty

Loading
Loading status checks…

942bfd5

Merge branch 'main' into main

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

Loading
Loading status checks…

2d4c376

Update adversarial scenario enum to exclude XPIA

Loading
Loading status checks…

98cad97

Update changelog

d510316

Black fixes

Loading
Loading status checks…

742943e

Remove duplicate import

Loading
Loading status checks…

12e0615

Fix the mypy error

Loading
Loading status checks…

de32b50

Mypy please be happy

Loading
Loading status checks…

4b64132

nagkumar91 and others added 21 commits January 28, 2025 11:56

Merge branch 'Azure:main' into main

Loading
Loading status checks…

c37b6c5

Merge branch 'Azure:main' into main

246ab9b

Merge branch 'Azure:main' into main

4767587

Merge branch 'Azure:main' into main

f7e6089

Merge branch 'Azure:main' into main

5b45900

Merge branch 'Azure:main' into main

b394fe2

Merge branch 'Azure:main' into main

54602fe

Merge branch 'Azure:main' into main

ff36631

Merge branch 'Azure:main' into main

f3e1850

Merge branch 'Azure:main' into main

16173c3

Merge branch 'Azure:main' into main

Loading
Loading status checks…

f856210

Merge branch 'Azure:main' into main

602a2e1

Merge branch 'Azure:main' into main

747c0db

Merge branch 'Azure:main' into main

7741608

Merge branch 'Azure:main' into main

5e36ddf

Merge branch 'Azure:main' into main

648d45b

Merge branch 'Azure:main' into main

b37ba2a

Merge branch 'Azure:main' into main

3782341

Merge branch 'Azure:main' into main

35682be

Merge branch 'Azure:main' into main

c8dd420

Update the threshold to be individual parameters

Loading
Loading status checks…

d225e2c

Copilot bot review requested due to automatic review settings March 20, 2025 22:11

nagkumar91 requested a review from a team as a code owner March 20, 2025 22:11

github-actions bot added the Evaluation label Mar 20, 2025

Copilot AI reviewed Mar 20, 2025

View reviewed changes

Remove higher is better as a public ref

Loading
Loading status checks…

d20e915

w-javed approved these changes Mar 24, 2025

View reviewed changes

w-javed merged commit 8d3bb37 into Azure:main Mar 24, 2025
19 checks passed

nagkumar91 deleted the task/update_binarization branch March 25, 2025 01:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update binarization to be individual params #40168

Update binarization to be individual params #40168

nagkumar91 commented Mar 20, 2025 •

edited

Loading

azure-sdk commented Mar 20, 2025

Update binarization to be individual params #40168

Update binarization to be individual params #40168

Conversation

nagkumar91 commented Mar 20, 2025 • edited Loading

Description

All SDK Contribution checklist:

General Guidelines and Best Practices

Testing Guidelines

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

azure-sdk commented Mar 20, 2025

nagkumar91 commented Mar 20, 2025 •

edited

Loading