[metrics] AUROC Metric can't handle 0 observations of a class with multiclass classifier #348

BeyondTheProof · 2021-07-01T17:57:43Z

I'm attempting to calculate AUROC for a multiclass problem where some classes are very rare, occasionally never seen, and I'm getting the following error: raise ValueError("No positive samples in targets, true positive value should be meaningless")

In the case of 0 observations, I feel the average='weighted' should work, since the contribution to the final AUROC should be 0 regardless. One can think of other scenarios where there are a very high number of classes, some of which will happen to not be seen in some dataset.

Originally posted by @BeyondTheProof in Lightning-AI/pytorch-lightning#2210 (comment)

The text was updated successfully, but these errors were encountered:

BeyondTheProof · 2021-07-01T22:04:21Z

I've found a hack for this by subclassing AUROC:

class FixedAUC(torchmetrics.AUROC):
    def update(self, preds: torch.Tensor, target: torch.Tensor):
        num_classes = preds.shape[1]
        zero_obs_mask = torch.tensor([(target == c).sum() > 0 for c in range(num_classes)])
        preds = preds[:, zero_obs_mask]
        target = target[:, zero_obs_mask]
        self.num_classes = zero_obs_mask.sum().int()
        super().update(torch.softmax(preds, axis=-1).data, target.argmax(axis=-1).int().data)

I will be making a cleaner fix in torch and asking for a PR

github-actions · 2021-07-02T13:40:35Z

Hi! thanks for your contribution!, great first issue!

SkafteNicki · 2021-07-02T13:41:28Z

Hi @BeyondTheProof, I transferred the issue to the torchmetrics repo.
Can I kindly ask if you are calling the forward method or the update method of the metric when trying to update the metric states?

BeyondTheProof · 2021-07-02T19:20:23Z

Hi @SkafteNicki, thank you for the response and transferring the issue! I am just calling the update method.

SkafteNicki · 2021-07-05T14:01:22Z

Could you provide an example of how you are using the metric?

BeyondTheProof · 2021-07-07T20:21:45Z

In my subclassed lightning module, I have metrics that are calculated at the end of each epoch:

self.auroc = torchmetrics.AUROC(num_classes=15, average="weighted", compute_on_step=False)
self.accuracy = pl.metrics.Accuracy(num_classes=15, average="weighted", compute_on_step=False)
self.metrics = [self.auroc, self.accuracy]

When I calculate all my losses, I also calculate some other metrics (if there are any bugs, it is only because I modified it to simplify as much as possible):

for metric, name in zip(self.metrics, ["AUROC", "ACC"]):
  self.log(
      f"{stage}_{name}",
      metric,
      on_step=False,
      on_epoch=True,
      prog_bar=False,
      logger=True,
  )

I hope this helps. If there's anything else you need, please let me know!

BeyondTheProof · 2021-07-09T16:43:08Z

I've found a hack for this by subclassing AUROC:

class FixedAUC(torchmetrics.AUROC):
    def update(self, preds: torch.Tensor, target: torch.Tensor):
        num_classes = preds.shape[1]
        zero_obs_mask = torch.tensor([(target == c).sum() > 0 for c in range(num_classes)])
        preds = preds[:, zero_obs_mask]
        target = target[:, zero_obs_mask]
        self.num_classes = zero_obs_mask.sum().int()
        super().update(torch.softmax(preds, axis=-1).data, target.argmax(axis=-1).int().data)

I will be making a cleaner fix in torch and asking for a PR

@SkafteNicki In this subclass, there is an error that arises when some batches have only two classes, others have more. I implemented a much cleaner subclass by just subclassing Metrics:

from torchmetrics.functional.classification.auroc import _auroc_compute


class WeightedAUROC(Metric):
    """
    This is used for when the target is not strictly one of K classes, but a probability
    distribution over all K classes
    """

    def __init__(
        self,
        num_classes: Optional[int] = None,
        pos_label: Optional[int] = None,
        compute_on_step: bool = False,
        dist_sync_on_step: bool = False,
    ) -> None:
        super().__init__(
            compute_on_step=compute_on_step,
            dist_sync_on_step=dist_sync_on_step,
            process_group=process_group,
        )
        self.num_classes = num_classes
        if self.num_classes > 2:
            self.mode = "multiclass"
        else:
            self.mode = "binary"
        self.pos_label = pos_label
        self.average = "weighted"

        self.add_state("preds", default=[], dist_reduce_fx="cat")
        self.add_state("target", default=[], dist_reduce_fx="cat")

    def update(self, preds: torch.Tensor, target: torch.Tensor):
        assert target.ndim == 2, f"WeightedAUC expects a 2D Tensor, got {target.ndim}"
        self.preds.append(torch.softmax(preds, axis=-1).data)
        self.target.append(target.data)

    def compute(self):
        preds = dim_zero_cat(self.preds)
        target = dim_zero_cat(self.target)
        zero_obs_mask = torch.tensor([(target == c).sum() > 0 for c in range(self.num_classes)])
        preds = preds[:, zero_obs_mask]
        target = target[:, zero_obs_mask].int()
        num_classes = zero_obs_mask.sum().int()
        return _auroc_compute(preds, target, self.mode, num_classes=num_classes, average=self.average)

KimDaeUng · 2021-07-10T00:49:28Z

@BeyondTheProof I thinks that it will be better to add the line process_group: Optional[Any] = None,.



class WeightedAUROC(Metric):
    """
    This is used for when the target is not strictly one of K classes, but a probability
    distribution over all K classes
    """

    def __init__(
        self,
        num_classes: Optional[int] = None,
        pos_label: Optional[int] = None,
        compute_on_step: bool = False,
        dist_sync_on_step: bool = False,
        process_group: Optional[Any] = None, # It seems that this line is missing.
    ) -> None:
        super().__init__(
            compute_on_step=compute_on_step,
            dist_sync_on_step=dist_sync_on_step,
            process_group=process_group,
        )
        self.num_classes = num_classes
        if self.num_classes > 2:
            self.mode = "multiclass"
        else:
            self.mode = "binary"
        self.pos_label = pos_label
        self.average = "weighted"

        self.add_state("preds", default=[], dist_reduce_fx="cat")
        self.add_state("target", default=[], dist_reduce_fx="cat")

    def update(self, preds: torch.Tensor, target: torch.Tensor):
        assert target.ndim == 2, f"WeightedAUC expects a 2D Tensor, got {target.ndim}"
        self.preds.append(torch.softmax(preds, axis=-1).data)
        self.target.append(target.data)

    def compute(self):
        preds = dim_zero_cat(self.preds)
        target = dim_zero_cat(self.target)
        zero_obs_mask = torch.tensor([(target == c).sum() > 0 for c in range(self.num_classes)])
        preds = preds[:, zero_obs_mask]
        target = target[:, zero_obs_mask].int()
        num_classes = zero_obs_mask.sum().int()
        return _auroc_compute(preds, target, self.mode, num_classes=num_classes, average=self.average)

SkafteNicki · 2021-07-12T13:00:36Z

Hi @BeyondTheProof, thanks for getting back to me. It indeed seems to be a problem, even though I would consider this a very corner case. Would you be up for sending a fix to our current AUROC implementation?
IMO the only thing missing from your implementation is that the user should be warned if a class was removed.

BeyondTheProof · 2021-07-12T18:44:21Z

Hi @SkafteNicki, good point about the user warning. Also, in the implementation above, the target only allows 2D tensors, i.e., a (obs, classes) binary matrix. I will implement a solution for this as well.

BeyondTheProof · 2021-07-12T18:45:49Z

@BeyondTheProof I thinks that it will be better to add the line process_group: Optional[Any] = None,.

Thanks for the catch @KimDaeUng, you're totally right!

BeyondTheProof · 2021-07-15T18:28:06Z

@SkafteNicki Submitted a PR here: #376

Thanks!

BeyondTheProof · 2021-07-21T16:24:26Z

Hi @SkafteNicki, just following up on this. I have an approval from Borda, but still need one more :)

maximsch2 · 2021-07-21T17:35:06Z

The tests are failing in the PR though?

BeyondTheProof · 2021-07-21T18:45:10Z

Ah, sorry, I didn't catch that. Will fix. Thank you!

BeyondTheProof changed the title ~~[metrics] AUROC Metric cant handle 0 observations of a class with multiclass classifier~~ [metrics] AUROC Metric can't handle 0 observations of a class with multiclass classifier Jul 1, 2021

SkafteNicki transferred this issue from Lightning-AI/pytorch-lightning Jul 2, 2021

Borda added bug / fix Something isn't working waiting on author labels Jul 7, 2021

BeyondTheProof mentioned this issue Jul 15, 2021

Weighted AUROC to omit empty classes #376

Merged

4 tasks

SkafteNicki closed this as completed in #376 Jul 26, 2021

Borda added this to the v0.5 milestone Aug 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[metrics] AUROC Metric can't handle 0 observations of a class with multiclass classifier #348

[metrics] AUROC Metric can't handle 0 observations of a class with multiclass classifier #348

BeyondTheProof commented Jul 1, 2021

BeyondTheProof commented Jul 1, 2021 •

edited by Borda

Loading

github-actions bot commented Jul 2, 2021

SkafteNicki commented Jul 2, 2021

BeyondTheProof commented Jul 2, 2021

SkafteNicki commented Jul 5, 2021

BeyondTheProof commented Jul 7, 2021

BeyondTheProof commented Jul 9, 2021 •

edited

Loading

KimDaeUng commented Jul 10, 2021

SkafteNicki commented Jul 12, 2021

BeyondTheProof commented Jul 12, 2021

BeyondTheProof commented Jul 12, 2021

BeyondTheProof commented Jul 15, 2021

BeyondTheProof commented Jul 21, 2021

maximsch2 commented Jul 21, 2021

BeyondTheProof commented Jul 21, 2021 •

edited

Loading

[metrics] AUROC Metric can't handle 0 observations of a class with multiclass classifier #348

[metrics] AUROC Metric can't handle 0 observations of a class with multiclass classifier #348

Comments

BeyondTheProof commented Jul 1, 2021

BeyondTheProof commented Jul 1, 2021 • edited by Borda Loading

github-actions bot commented Jul 2, 2021

SkafteNicki commented Jul 2, 2021

BeyondTheProof commented Jul 2, 2021

SkafteNicki commented Jul 5, 2021

BeyondTheProof commented Jul 7, 2021

BeyondTheProof commented Jul 9, 2021 • edited Loading

KimDaeUng commented Jul 10, 2021

SkafteNicki commented Jul 12, 2021

BeyondTheProof commented Jul 12, 2021

BeyondTheProof commented Jul 12, 2021

BeyondTheProof commented Jul 15, 2021

BeyondTheProof commented Jul 21, 2021

maximsch2 commented Jul 21, 2021

BeyondTheProof commented Jul 21, 2021 • edited Loading

BeyondTheProof commented Jul 1, 2021 •

edited by Borda

Loading

BeyondTheProof commented Jul 9, 2021 •

edited

Loading

BeyondTheProof commented Jul 21, 2021 •

edited

Loading