Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AU-GOOD for target distribution #33

Closed
RaulFD-creator opened this issue Sep 23, 2024 · 0 comments
Closed

AU-GOOD for target distribution #33

RaulFD-creator opened this issue Sep 23, 2024 · 0 comments
Assignees
Labels
enhancement New feature or request

Comments

@RaulFD-creator
Copy link
Collaborator

Motivation
Final step of the pipeline relies on given a set of results expressed as a dictionary of {threshold: metrics_dict}, a target value to calculate the AU-GOOD with, a target dataset, and the set of similarity metrics used to calculate the partitions. It then calculates the similarity between the original data and the target dataset.

Possible implementation

  1. Calculate similarities between query and target distributions, using already implemented functions.
  2. Create a histogram with the same min and max value as the partitions, and with the same step for the number of bins.
  3. Normalise the histogram (counts / counts.sum())
  4. To get the AU-GOOD, perform dot product between normalise counts and values. This is equivalent to sum(a*b), which is the finite form of the integral AU-GOOD integral.

Alternatives
4b. Calculate a*b and sum(a*b), separately so that a user can have access to the GOOD curve to represent if they so desire.

@RaulFD-creator RaulFD-creator added the enhancement New feature or request label Sep 23, 2024
@RaulFD-creator RaulFD-creator self-assigned this Sep 23, 2024
RaulFD-creator added a commit that referenced this issue Nov 11, 2024
🛠️ Code-dev: Implemented #33. AU-GOOD for arbitrary target distribution
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant