Fix BeitFeatureExtractor postprocessing #19119

alaradirik · 2022-09-20T09:01:05Z

What does this PR do?

Fixes a BeitFeatureExtractor.post_process_semantic_segmentation() assertion error when no target_sizes argument is provided
Ensures post_process_semantic_segmentation returns a list of int64 PyTorch tensors
Adds a test to ensure correct post-processing

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
[X ] Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
[X ] Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
[X ] Did you write any new necessary tests?

NielsRogge · 2022-09-20T09:53:51Z

src/transformers/models/beit/feature_extraction_beit.py

+                resized = self.resize(image=semantic_segmentation[idx], size=target_sizes[idx])
+                resized_maps.append(resized)
+
+            semantic_segmentation = [torch.Tensor(np.array(image)).to(torch.int64) for image in resized_maps]


Suggested change

semantic_segmentation = [torch.Tensor(np.array(image)).to(torch.int64) for image in resized_maps]

semantic_segmentation = [torch.Tensor(np.array(image)).long() for image in resized_maps]

This should also work :)

NielsRogge · 2022-09-20T12:25:31Z

src/transformers/models/beit/feature_extraction_beit.py

+                None, predictions will not be resized.
+        Returns:
+            semantic_segmentation: `List[torch.Tensor]` of length `batch_size`, where each item is a semantic
+            segmentation map of shape (w, h) corresponding to the target_sizes entry (if `target_sizes` is specified).


To me it's a bit counterintuitive to output (w, h) if we ask the target_sizes to be (h, w)

=> so I'd use the same format for both

Agreed, just fixed this

NielsRogge · 2022-09-20T13:24:05Z

src/transformers/models/beit/feature_extraction_beit.py

+            semantic_segmentation = semantic_segmentation.numpy()
+
+            for idx in range(len(semantic_segmentation)):
+                resized = self.resize(image=semantic_segmentation[idx], size=target_sizes[idx])


As discussed offline, please use nn.functional.interpolate here.

resized = nn.functional.interpolate(semantic_segmentation[idx], size=target_sizes[idx], mode='bilinear', align_corners=False)

HuggingFaceDocBuilderDev · 2022-09-20T15:19:18Z

The documentation is not available anymore as the PR was closed or merged.

NielsRogge

LGTM, although it will require an update when supporting the TF model.

LysandreJik · 2022-09-21T13:53:17Z

Hey @alaradirik, please also ping a core maintainer for review before merging PRs.

* return post-processed segmentations as list, add test * use torch to resize logits * fix assertion error if no target_size is specified

alaradirik added 6 commits September 17, 2022 17:08

add post_process_semantic_segmentation method

29ebf2b

update docs

68e3d05

fix test errors

6dc836b

fix formatting

c1d79b4

fix formatting

c840b4f

return post-processed segmentations as list, add test

bbf3cee

alaradirik requested a review from NielsRogge September 20, 2022 09:01

alaradirik changed the title ~~Beit postprocessing~~ Fix BeitFeatureExtractor postprocessing Sep 20, 2022

NielsRogge reviewed Sep 20, 2022

View reviewed changes

minor changes

85e4e91

NielsRogge reviewed Sep 20, 2022

View reviewed changes

alaradirik and others added 4 commits September 20, 2022 16:55

use torch to resize logits

9d0059d

fix conflict

71e0f23

push updates

711cc7c

Merge branch 'main' into beit-postprocessing

9fc71f6

NielsRogge approved these changes Sep 20, 2022

View reviewed changes

alaradirik merged commit 36b9a99 into huggingface:main Sep 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix BeitFeatureExtractor postprocessing #19119

Fix BeitFeatureExtractor postprocessing #19119

alaradirik commented Sep 20, 2022

NielsRogge Sep 20, 2022

NielsRogge Sep 20, 2022

alaradirik Sep 20, 2022

NielsRogge Sep 20, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Sep 20, 2022 •

edited

Loading

NielsRogge left a comment

LysandreJik commented Sep 21, 2022

	semantic_segmentation = [torch.Tensor(np.array(image)).to(torch.int64) for image in resized_maps]
	semantic_segmentation = [torch.Tensor(np.array(image)).long() for image in resized_maps]

Fix BeitFeatureExtractor postprocessing #19119

Fix BeitFeatureExtractor postprocessing #19119

Conversation

alaradirik commented Sep 20, 2022

What does this PR do?

Before submitting

NielsRogge Sep 20, 2022

Choose a reason for hiding this comment

NielsRogge Sep 20, 2022

Choose a reason for hiding this comment

alaradirik Sep 20, 2022

Choose a reason for hiding this comment

NielsRogge Sep 20, 2022 • edited Loading

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Sep 20, 2022 • edited Loading

NielsRogge left a comment

Choose a reason for hiding this comment

LysandreJik commented Sep 21, 2022

NielsRogge Sep 20, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Sep 20, 2022 •

edited

Loading