Binary group fairness metrics #1404

AndresAlgaba · 2022-12-21T09:51:35Z

What does this PR do?

This PR initializes the addition of observational group fairness metrics. The general idea of these metrics is to compare the model's outputs across different groups created by the protected attribute(s) under evaluation.

As a first step, I implemented two common group fairness metrics, demographic parity and equal opportunity, for binary classification problems. For demographic parity, we compare the positivity rates and, for equal opportunity, the true positive rates. In the case of more than two groups, we use the largest disparity, i.e., dividing the lowest rate between the highest.

In this initial proposal, I build on the stat_scores and add some logic to compute the positivity and true positive rates for groups. There are still several issues, but I believe that this proposal allows for a more clear discussion. For example, I'm unsure whether the _binary_groups_stat_scores should be part of stat_scores or group_fairness. Overall, I tried to follow the style and logic of the other classification metrics as closely as possible.

I will also add documentation and appropriate testing at a later stage when the big decisions on the design of the API for group fairness metrics are settled. There are also several potential extensions, such as including additional classification metrics for detecting discrimination and adding a similar logic to regression-based metrics.

I leave it as a draft for now. @Borda

Before submitting

Was this discussed/approved via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure to update the docs?
Did you write any new necessary tests?

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

for more information, see https://pre-commit.ci

src/torchmetrics/classification/__init__.py

src/torchmetrics/classification/group_fairness.py

Borda · 2022-12-23T05:21:08Z

hi @stancld or @lucadiliello could you pls assist here with this PR?

lucadiliello

Hello, thanks for your contribution! I added some suggestions to improve code speed. Since I solved a similar problem when grouping metrics by indexes, if you need help feel free to ask!

src/torchmetrics/functional/classification/group_fairness.py

codecov · 2022-12-23T14:40:34Z

Codecov Report

Merging #1404 (d0f7b17) into master (7821012) will increase coverage by 0%.
The diff coverage is 72%.

Additional details and impacted files

@@           Coverage Diff           @@
##           master   #1404    +/-   ##
=======================================
  Coverage      88%     88%            
=======================================
  Files         223     225     +2     
  Lines       11708   11860   +152     
=======================================
+ Hits        10289   10466   +177     
+ Misses       1419    1394    -25

…to fairness

AndresAlgaba · 2022-12-26T13:50:07Z

I have added documentation.

for more information, see https://pre-commit.ci

stancld · 2023-02-28T13:12:41Z

@Borda @justusschock Do you have any clue why our tests failing with some old configurations? 🤔

Borda · 2023-02-28T21:45:09Z

@Borda @justusschock Do you have any clue why our tests failing with some old configurations? thinking

I think I remember this one, you are passing somewhere none instead of expected int:

TypeError: '<' not supported between instances of 'int' and 'NoneType'

I would suggest recreating this env and debugging locally... to do you you can use:
https://github.com/Lightning-AI/metrics/blob/50388cfb167ba323fcc407a89d83f2aec0dfb171/.github/actions/pull-caches/action.yml#L33

requirements/classification_test.txt

stancld · 2023-03-02T19:22:25Z

@Borda @justusschock Do you have any clue why our tests failing with some old configurations? thinking

I think I remember this one, you are passing somewhere none instead of expected int:
TypeError: '<' not supported between instances of 'int' and 'NoneType'
I would suggest recreating this env and debugging locally... to do you you can use:

https://github.com/Lightning-AI/metrics/blob/50388cfb167ba323fcc407a89d83f2aec0dfb171/.github/actions/pull-caches/action.yml#L33

Cool, thanks for the advice. I've setup the docker and will try to figure out what's going on there.

AndresAlgaba · 2023-03-02T19:24:18Z

@Borda @justusschock Do you have any clue why our tests failing with some old configurations? thinking

I think I remember this one, you are passing somewhere none instead of expected int:
TypeError: '<' not supported between instances of 'int' and 'NoneType'
I would suggest recreating this env and debugging locally... to do you you can use:
https://github.com/Lightning-AI/metrics/blob/50388cfb167ba323fcc407a89d83f2aec0dfb171/.github/actions/pull-caches/action.yml#L33
Cool, thanks for the advice. I've setup the docker and will try to figure out what's going on there.

Hey @stancld, doing the same currently :D Let me know if I can be of any help! It indeed seems to be some interplay with the fairlearn library.

stancld · 2023-03-02T19:44:35Z

@Bordafairlearn declares supports for python 3.8 and newer. What about skipping tests for python==3.7 then? It fails within the library regardless of pandas/numpy version installed with python 3.7.

setuptools.setup(
    name=fairlearn.__name__,
    version=fairlearn.__version__,
    author="Miroslav Dudik, Richard Edgar, Adrin Jalali, Roman Lutz, Michael Madaio, Hilde Weerts",
    author_email="fairlearn-internal@python.org",
    description="A Python package to assess and improve fairness of machine learning models.",
    long_description=long_description,
    long_description_content_type="text/markdown",
    url="https://github.com/fairlearn/fairlearn",
    packages=setuptools.find_packages(https://github.com/fairlearn/fairlearn/commit/4f8dddad9fe24a914db4ffd3a2f699e3248c46c5,
    python_requires=">=3.8",
    install_requires=install_requires,
    extras_require=extras_require,
    classifiers=[
        "Programming Language :: Python :: 3.8",
        "Programming Language :: Python :: 3.9",
        "Programming Language :: Python :: 3.10",
        "License :: OSI Approved :: MIT License",
        "Operating System :: OS Independent",
        "Development Status :: 3 - Alpha",
    ],
    include_package_data=True,
    zip_safe=False,
https://github.com/fairlearn/fairlearn/commit/1cbf29a45da00c5eb0ee68c5cc9e53314d229821

cc: @AndresAlgaba

(Btw, security support for python 3.7 ends in 3 months, so we can try to replace 3.7 support with newer 3.11 in general in the future.)

stancld · 2023-03-03T19:27:28Z

Hi @AndresAlgaba, would you please check why make-docs job is failing? 🙏 Otherwise, everything looks good now :]

stancld · 2023-03-03T21:07:44Z

@AndresAlgaba -- Sorry, actually found out one link for InfoLM seems to be broken.

Borda · 2023-04-18T11:34:28Z

@AndresAlgaba thank you, and apology that it took so long...

AndresAlgaba · 2023-04-18T11:37:05Z

@Borda, my pleasure! And thanks to the entire team for all the help :D

init fairness

7db8db3

AndresAlgaba requested review from Borda, SkafteNicki, justusschock, tchaton and ethanwharris as code owners December 21, 2022 09:51

AndresAlgaba marked this pull request as draft December 21, 2022 09:51

[pre-commit.ci] auto fixes from pre-commit.com hooks

2a383ec

for more information, see https://pre-commit.ci

Borda added the New metric label Dec 21, 2022

Borda added this to the v0.12 milestone Dec 21, 2022

AndresAlgaba commented Dec 21, 2022

View reviewed changes

src/torchmetrics/classification/__init__.py Outdated Show resolved Hide resolved

AndresAlgaba commented Dec 21, 2022

View reviewed changes

src/torchmetrics/classification/group_fairness.py Outdated Show resolved Hide resolved

fix type

fe41f56

Borda assigned stancld Dec 23, 2022

Merge branch 'master' into fairness

3ec0e6d

lucadiliello reviewed Dec 23, 2022

View reviewed changes

src/torchmetrics/functional/classification/group_fairness.py Show resolved Hide resolved

src/torchmetrics/functional/classification/group_fairness.py Show resolved Hide resolved

src/torchmetrics/functional/classification/group_fairness.py Show resolved Hide resolved

AndresAlgaba added 6 commits December 24, 2022 10:23

Merge branch 'master' into fairness

05b30f6

improve code speed

7458615

improvde code speed

0a754c2

Merge branch 'master' into fairness

3ef0a17

add docs

21d689f

Merge branch 'fairness' of https://github.com/AndresAlgaba/metrics in…

baa996d

…to fairness

pre-commit-ci bot and others added 5 commits December 26, 2022 13:50

[pre-commit.ci] auto fixes from pre-commit.com hooks

cf15608

for more information, see https://pre-commit.ci

fix docs

5ca45eb

Merge branch 'master' into fairness

5cf8660

Merge branch 'master' into fairness

66d1ef9

Merge branch 'master' into fairness

5e5f96a

Merge branch 'master' into fairness

c1c3276

mergify bot added 3 commits February 28, 2023 16:15

Merge branch 'master' into fairness

b327066

Merge branch 'master' into fairness

020ea63

Merge branch 'master' into fairness

092878f

Merge branch 'master' into fairness

79ce813

mergify bot added ready and removed ready labels Mar 1, 2023

Merge branch 'master' into fairness

3477658

auto-merge was automatically disabled March 1, 2023 23:51
Merge queue setting changed

stancld reviewed Mar 2, 2023

View reviewed changes

requirements/classification_test.txt Show resolved Hide resolved

Merge branch 'master' into fairness

0516295

stancld added 3 commits March 3, 2023 13:24

reqs: Add pandas to classification test reqs

fe16216

python: Skip tests for python 3.7 not supported by reference package

a80d2b9

typo: Skip tests for python 3.7 not supported by reference package

7b91bb5

mergify bot added the ready label Mar 3, 2023

Try to fix links for make-docs

e74f6f9

Merge branch 'master' into fairness

cc1ca50

Borda enabled auto-merge (squash) March 3, 2023 21:55

stancld and others added 2 commits March 4, 2023 00:03

Merge branch 'master' into fairness

5be0127

Merge branch 'master' into fairness

d0f7b17

Borda merged commit 7c885d0 into Lightning-AI:master Mar 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Binary group fairness metrics #1404

Binary group fairness metrics #1404

AndresAlgaba commented Dec 21, 2022 •

edited by stancld

Loading

Borda commented Dec 23, 2022

lucadiliello left a comment

codecov bot commented Dec 23, 2022 •

edited

Loading

AndresAlgaba commented Dec 26, 2022

stancld commented Feb 28, 2023

Borda commented Feb 28, 2023

stancld commented Mar 2, 2023

AndresAlgaba commented Mar 2, 2023

stancld commented Mar 2, 2023 •

edited

Loading

stancld commented Mar 3, 2023

stancld commented Mar 3, 2023

Borda commented Apr 18, 2023

AndresAlgaba commented Apr 18, 2023

Binary group fairness metrics #1404

Binary group fairness metrics #1404

Conversation

AndresAlgaba commented Dec 21, 2022 • edited by stancld Loading

What does this PR do?

Before submitting

PR review

Did you have fun?

Borda commented Dec 23, 2022

lucadiliello left a comment

Choose a reason for hiding this comment

codecov bot commented Dec 23, 2022 • edited Loading

Codecov Report

AndresAlgaba commented Dec 26, 2022

stancld commented Feb 28, 2023

Borda commented Feb 28, 2023

stancld commented Mar 2, 2023

AndresAlgaba commented Mar 2, 2023

stancld commented Mar 2, 2023 • edited Loading

stancld commented Mar 3, 2023

stancld commented Mar 3, 2023

Borda commented Apr 18, 2023

AndresAlgaba commented Apr 18, 2023

AndresAlgaba commented Dec 21, 2022 •

edited by stancld

Loading

codecov bot commented Dec 23, 2022 •

edited

Loading

stancld commented Mar 2, 2023 •

edited

Loading