add KeyPoint feature for prototype datasets #5326

pmeier · 2022-02-01T09:50:12Z

We currently have two different APIs for keypoints:

draw_keypoints requires the keypoints to be in shape (num_instances, num_keypoints_per_instance, 3) where the last channel contains the x and y coordinate.
KeyKeypointRCNN's such as keypointrcnn_resnet50_fpn require the keypoints to be in shape (num_instances, num_keypoints_per_instance, 3) where the last channel contains the x and y coordinate and the visibility.

We currently also have two datasets that provide keypoints: COCO and CelebA. Of those COCO provides a visibility flag while CelebA doesn't. Skimming through other datasets, it seems the visibility is not a regular part of the annotations. Thus, for the KeyPoint feature that this PR adds, I went for only the x and y coordinates.

Below you can find example implementations that call both APIs for COCO and CelebA:

COCO

import torch
from torchvision.models.detection import keypointrcnn_resnet50_fpn
from torchvision.prototype import datasets
from torchvision.utils import draw_keypoints
from torchvision.transforms.functional import to_pil_image

name = "coco"

dataset = datasets.load(name, annotations="person_keypoints")
sample = next(iter(dataset))

annotated_image = draw_keypoints(
    sample["image"],
    sample["keypoints"],
    radius=int(2e-2 * min(sample["image"].shape[-2:])),
    colors="red",
)
to_pil_image(annotated_image).save(f"{name}.jpg")


model = keypointrcnn_resnet50_fpn(
    num_classes=len(datasets.info(name).categories),
    num_keypoints=sample["keypoints"].shape[-2],
)

# this conversion will happen in a transform later
image = sample["image"].to(torch.float).div(255.0)

target = sample

# the model requires the bounding boxes as 3d tensor and in XYXY format at the 'boxes' key
target["boxes"] = target["bounding_boxes"].convert("xyxy")

keypoints_without_visibility = target["keypoints"]
keypoints_visibility = target["visibility"].unsqueeze(-1)
target["keypoints"] = torch.cat((keypoints_without_visibility, keypoints_visibility), dim=-1).to(torch.float)

# images and targets need to be list of images and annotation dictionaries
loss = model([image], [target])

Example for CelebA

import torch
from torchvision.models.detection import keypointrcnn_resnet50_fpn
from torchvision.prototype import datasets
from torchvision.utils import draw_keypoints
from torchvision.transforms.functional import to_pil_image

name = "celeba"

dataset = datasets.load(name)
sample = next(iter(dataset))

annotated_image = draw_keypoints(
    sample["image"],
    sample["keypoints"].unsqueeze(0),
    radius=int(2e-2 * min(sample["image"].shape[-2:])),
    colors="red",
)
to_pil_image(annotated_image).save(f"{name}.jpg")


model = keypointrcnn_resnet50_fpn(
    num_classes=len(datasets.info(name).categories),
    num_keypoints=sample["keypoints"].shape[-2],
)

# this conversion will happen in a transform later
image = sample["image"].to(torch.float).div(255.0)

target = sample

# the model requires the label as 1d tensor at the 'labels' key
target["labels"] = target["identity"].unsqueeze(0)
# the model requires the bounding boxes as 3d tensor and in XYXY format at the 'boxes' key
target["boxes"] = target["bbox"].convert("xyxy").unsqueeze(0)

keypoints_without_visibility = target["keypoints"]
# visbility == 2 denotes visible in COCO annotations
keypoints_visibility = torch.full((*target["keypoints"].shape[:-1], 1), 2)
target["keypoints"] = torch.cat((keypoints_without_visibility, keypoints_visibility), dim=-1).to(torch.float)

# images and targets need to be list of images and annotation dictionaries
loss = model([image], [target])

facebook-github-bot · 2022-02-01T09:50:20Z

💊 CI failures summary and remediations

As of commit 4bab7c3 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

fmassa

Thanks for the PR!

I've made some comments, let me know what you think

torchvision/prototype/transforms/_geometry.py

torchvision/prototype/features/_keypoint.py

fmassa · 2022-02-01T17:07:33Z

torchvision/prototype/datasets/_builtin/coco.py

@@ -129,6 +159,7 @@ def _decode_captions_ann(self, anns: List[Dict[str, Any]], image_meta: Dict[str,
    _ANN_DECODERS = OrderedDict(
        [
            ("instances", _decode_instances_anns),
+            ("person_keypoints", _decode_person_keypoints_anns),


This is unrelated to this PR, but I would maybe revisit if we want to have dataset to have a varying return type depending on arguments passed to it.
It might be better to have different flavors of the dataset to be different dataset classes.

Yeah, that is a design choice that we should address. cc @NicolasHug

pmeier · 2022-02-03T13:49:25Z

In response to #5326 (comment) I've added symmetries meta data to the KeyPoint feature. With this each dataset can specify what symmetries exists between the keypoints.

Here is an example with HorizontalFlip:

from torchvision.prototype import datasets, transforms
from torchvision.utils import draw_keypoints

from torchvision.transforms.functional import to_pil_image


def draw_non_right_keypoints(sample, file):
    image = sample["image"]
    keypoints = sample["keypoints"]
    non_right_keypoints = keypoints[[not descr.startswith("right") for descr in keypoints.descriptions], :]

    annotated_image = draw_keypoints(
        image,
        non_right_keypoints.unsqueeze(0),
        radius=int(2e-2 * min(sample["image"].shape[-2:])),
        colors="red",
    )
    to_pil_image(annotated_image).save(file)


dataset = datasets.load("celeba")
sample = next(iter(dataset))

draw_non_right_keypoints(sample, "before.jpg")

transform = transforms.HorizontalFlip()
transformed_sample = transform(sample)

draw_non_right_keypoints(transformed_sample, "after.jpg")

fmassa

The proposal for the symmetry LGTM, thanks!

I only have one more comment regarding the resize transform, otherwise good to merge

fmassa · 2022-02-04T15:24:47Z

torchvision/prototype/transforms/_geometry.py


 class Resize(Transform):
-    NO_OP_FEATURE_TYPES = {Label}
+    NO_OP_FEATURE_TYPES = {Label, Keypoint}


Ok if we leave this as a TODO, but we should change the coordinates of the keypoints following the rescaling factor

Yes, in general all geometric transforms need to support Keypoint's. We can probably share a lot of functionality with the bounding box kernels.

torchvision/prototype/datasets/_builtin/celeba.py

datumbox

Thanks, I got some questions below

datumbox · 2022-02-07T09:11:48Z

torchvision/prototype/datasets/_builtin/coco.py

+            ("vertical", description, description.replace("left", "right"))
+            for description in descriptions
+            if description.startswith("left")
+        ]


Could you clarify the type of the resulting symmetries variable?

Below they Keypoint expects Sequence[Tuple[KeypointSymmetry, Tuple[int, int]]]. It's non obvious to me how the current code has the specific type.

torchvision/prototype/datasets/_builtin/celeba.py

torchvision/prototype/features/_keypoint.py

pmeier · 2022-02-07T15:30:38Z

As discussed offline with @datumbox, we'll put this PR on hold until we have full support for bounding boxes and segmentation masks.

add keypoints annotations to COCO prototype dataset

20d2c3a

pmeier added module: datasets prototype labels Feb 1, 2022

pmeier requested a review from NicolasHug February 1, 2022 09:50

pytorch-bot bot added the ciflow/default label Feb 1, 2022

facebook-github-bot added the cla signed label Feb 1, 2022

add KeyPoint feature and use it in COCO and CelebA

61d1d73

pmeier changed the title ~~add keypoints annotations to COCO prototype dataset~~ add KeyPoint feature for prototype datasets Feb 1, 2022

pmeier requested a review from fmassa February 1, 2022 12:48

pmeier added 2 commits February 1, 2022 13:54

cleanup

aa1b920

fix feature tests

db99617

fmassa reviewed Feb 1, 2022

View reviewed changes

pmeier mentioned this pull request Feb 3, 2022

port datasets from the old to the new API #5336

Open

31 tasks

add image size and symmetry meta data to keypoint

ea7aafa

fmassa reviewed Feb 4, 2022

View reviewed changes

pmeier added 3 commits February 7, 2022 10:27

fix mypy

8cd0136

extend celeba mock data

8d20406

Merge branch 'main' into datasets/coco-keypoints

0abf12f

datumbox reviewed Feb 7, 2022

View reviewed changes

Merge branch 'main' into datasets/coco-keypoints

4bab7c3

pmeier mentioned this pull request Oct 12, 2022

New datasets for human pose estimation #6698

Open

pmeier mentioned this pull request Feb 2, 2023

Compatibility layer between stable datasets and prototype transforms #6663

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add KeyPoint feature for prototype datasets #5326

add KeyPoint feature for prototype datasets #5326

pmeier commented Feb 1, 2022 •

edited

Loading

facebook-github-bot commented Feb 1, 2022 •

edited

Loading

fmassa left a comment

fmassa Feb 1, 2022

pmeier Feb 2, 2022

pmeier commented Feb 3, 2022 •

edited

Loading

fmassa left a comment

fmassa Feb 4, 2022

pmeier Feb 4, 2022

datumbox left a comment

datumbox Feb 7, 2022

pmeier commented Feb 7, 2022

add KeyPoint feature for prototype datasets #5326

Are you sure you want to change the base?

add KeyPoint feature for prototype datasets #5326

Conversation

pmeier commented Feb 1, 2022 • edited Loading

COCO

Example for CelebA

facebook-github-bot commented Feb 1, 2022 • edited Loading

💊 CI failures summary and remediations

fmassa left a comment

Choose a reason for hiding this comment

fmassa Feb 1, 2022

Choose a reason for hiding this comment

pmeier Feb 2, 2022

Choose a reason for hiding this comment

pmeier commented Feb 3, 2022 • edited Loading

fmassa left a comment

Choose a reason for hiding this comment

fmassa Feb 4, 2022

Choose a reason for hiding this comment

pmeier Feb 4, 2022

Choose a reason for hiding this comment

datumbox left a comment

Choose a reason for hiding this comment

datumbox Feb 7, 2022

Choose a reason for hiding this comment

pmeier commented Feb 7, 2022

pmeier commented Feb 1, 2022 •

edited

Loading

facebook-github-bot commented Feb 1, 2022 •

edited

Loading

pmeier commented Feb 3, 2022 •

edited

Loading