Fix/multiclass recall macro avg ignore index #2710

rittik9 · 2024-09-01T20:46:31Z

What does this PR do?

Fixes #2441

Details

Was this discussed/agreed via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure to update the docs?
Did you write any new necessary tests?

Did you have fun?

Yes

Issue:

The root of the problem seems to be that the ignore_index information is not being properly propagated to the final averaging step i.e. the _adjust_weights_safe_divide function doesn't know that which class should be ignored.

Solution:

To address this issue, I updated the code to ensure that the ignore_index information is preserved throughout the entire process, making sure it is correctly passed through all intermediate steps up to the final averaging stage i.e. _adjust_weights_safe_divide function .
Updated the _adjust_weights_safe_divide function to accept an additional ignore_index parameter, which is passed through the _precision_recall_reduce function, called in the compute method of the MulticlassRecall class. This change adjusts the weights in the _adjust_weights_safe_divide function, setting the weight of the ignored class to 0.

📚 Documentation preview 📚: https://torchmetrics--2710.org.readthedocs.build/en/2710/

Borda

looks good, can we add also test for this case...

rittik9 · 2024-09-02T22:17:10Z

looks good, can we add also test for this case...

Sure

pending on adding test

rittik9 · 2024-09-06T14:56:36Z

@Borda What do I have to modify?

Borda · 2024-09-11T12:56:46Z

@rittik9 mind checking the changed docstest values and whether it is correct?

…_index is specified

…ghtning-AI#2441)

src/torchmetrics/functional/classification/accuracy.py

src/torchmetrics/utilities/compute.py

src/torchmetrics/functional/classification/precision_recall.py

Borda · 2024-10-31T21:10:05Z

tests/unittests/classification/test_precision_recall.py

@@ -661,6 +661,37 @@ def test_corner_case():
    assert res == 1.0


+def test_multiclass_recall_ignore_index():


Seems we are already testing various ignore_index with reference metric so if we had it wrong this did not pass already... it is possible that we also have a bug in the reference metric?
cc: @SkafteNicki

looking to the code and the ignore index is already applied in _multilabel_stat_scores_format which reduces the preds/target size the same way as the reference metric so calling it with null weights in fact ignores additional index

The problem is we are using sklearn's recall_score as a reference for our unittests. So even if in _reference_sklearn_precision_recall_multiclass() function we are using remove_ignore_index function for removing those predictions whose real values are ignore_index class before passing it to recall_score function, it does not matter. Because whenever average='macro' sklearn's recall_score will always return mean cosidering the total no. of classes (as we are passing all the classes in recall_score() function's labels argument). That is the reason why unittests failed in the first place. I think we need to fix the unittests to take care of ignore_index using sklearn's recall_score() function's labels argument. I've prepared a notebook for explanation. cc:@Borda.

DimitrisMantas · 2024-11-04T22:55:36Z

Just to chime in, I think this issue is present in pretty much all metrics that make use of _adjust_weights_safe_divide.

I see this PR fixes, some of them, but others, such as JaccardIndex are left as is.

… wrong answer when ignore_index is specified

CHANGELOG.md

Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>

rittik9 requested review from SkafteNicki, Borda, justusschock and stancld as code owners September 1, 2024 20:46

github-actions bot added the topic: Classif label Sep 1, 2024

Borda reviewed Sep 2, 2024

View reviewed changes

rittik9 marked this pull request as draft September 2, 2024 19:12

rittik9 marked this pull request as ready for review September 2, 2024 22:15

rittik9 requested a review from Borda September 4, 2024 11:26

Borda previously approved these changes Sep 4, 2024

View reviewed changes

Borda self-requested a review September 6, 2024 09:10

Borda added the bug / fix Something isn't working label Sep 6, 2024

Borda approved these changes Sep 9, 2024

View reviewed changes

mergify bot added has conflicts and removed has conflicts labels Sep 9, 2024

justusschock approved these changes Sep 10, 2024

View reviewed changes

mergify bot added has conflicts and removed has conflicts labels Sep 10, 2024

Borda force-pushed the master branch from 96ceda0 to f12e7af Compare September 11, 2024 15:10

github-actions bot added topic: Audio topic: Nominal labels Sep 11, 2024

rittik9 and others added 4 commits September 11, 2024 17:18

Fix: Corrected MulticlassRecall macro average calculation when ignore…

176711d

…_index is specified

style: format code to comply with pre-commit hooks

df36d0f

test: Add test for MulticlassRecall with ignore_index+macro (fixes Li…

0773bab

…ghtning-AI#2441)

chlog

78177ac

Borda force-pushed the master branch from 04d8193 to 78177ac Compare September 11, 2024 15:19

mergify bot added has conflicts and removed has conflicts labels Oct 23, 2024

Borda requested a review from baskrahmer October 25, 2024 07:52

mergify bot added has conflicts and removed has conflicts labels Oct 29, 2024

Borda enabled auto-merge (squash) October 31, 2024 11:58

Borda reviewed Oct 31, 2024

View reviewed changes

src/torchmetrics/functional/classification/accuracy.py Outdated Show resolved Hide resolved

Borda reviewed Oct 31, 2024

View reviewed changes

src/torchmetrics/utilities/compute.py Outdated Show resolved Hide resolved

Borda reviewed Oct 31, 2024

View reviewed changes

src/torchmetrics/functional/classification/precision_recall.py Outdated Show resolved Hide resolved

Borda mentioned this pull request Oct 31, 2024

fix macro when ignore_index is set #2163

Draft

4 tasks

Borda requested changes Oct 31, 2024

View reviewed changes

rittik9 marked this pull request as draft November 1, 2024 20:31

auto-merge was automatically disabled November 1, 2024 20:31
Pull request was converted to draft

rittik9 marked this pull request as ready for review November 2, 2024 00:05

rittik9 marked this pull request as draft November 2, 2024 00:16

rittik9 marked this pull request as ready for review November 2, 2024 00:47

rittik9 marked this pull request as draft November 2, 2024 01:22

fix:Reference Metric in multiclass pecision recall unittests provides…

259c4bd

… wrong answer when ignore_index is specified

rittik9 force-pushed the master branch from aa7fb52 to 259c4bd Compare November 24, 2024 11:15

rittik9 added 3 commits November 24, 2024 19:57

refactor: compute.py

447031e

Merge branch 'master' into master

42e395e

modify _reference_sklearn_precision_recall_multiclass

58c0070

Borda reviewed Nov 25, 2024

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

rittik9 and others added 6 commits November 25, 2024 20:47

Update CHANGELOG.md

7b1a09f

Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>

Merge branch 'master' into master

f200da4

Merge branch 'master' into master

70b91c6

Pass down ignore_index

bf1c29f

Set weights only for the classes axis

930fba3

Update precision_recall.py

7e95696

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix/multiclass recall macro avg ignore index #2710

Fix/multiclass recall macro avg ignore index #2710

rittik9 commented Sep 1, 2024 •

edited

Loading

Borda left a comment

rittik9 commented Sep 2, 2024

rittik9 commented Sep 6, 2024

Borda commented Sep 11, 2024

Borda Oct 31, 2024

Borda Oct 31, 2024

rittik9 Nov 3, 2024 •

edited

Loading

DimitrisMantas commented Nov 4, 2024

		@@ -661,6 +661,37 @@ def test_corner_case():
		assert res == 1.0


		def test_multiclass_recall_ignore_index():

Fix/multiclass recall macro avg ignore index #2710

Are you sure you want to change the base?

Fix/multiclass recall macro avg ignore index #2710

Conversation

rittik9 commented Sep 1, 2024 • edited Loading

What does this PR do?

Did you have fun?

Issue:

Solution:

Borda left a comment

Choose a reason for hiding this comment

rittik9 commented Sep 2, 2024

rittik9 commented Sep 6, 2024

Borda commented Sep 11, 2024

Borda Oct 31, 2024

Choose a reason for hiding this comment

Borda Oct 31, 2024

Choose a reason for hiding this comment

rittik9 Nov 3, 2024 • edited Loading

Choose a reason for hiding this comment

DimitrisMantas commented Nov 4, 2024

rittik9 commented Sep 1, 2024 •

edited

Loading

rittik9 Nov 3, 2024 •

edited

Loading