Correct and clarify the handling of empty/zero-length `Doc`s during training and inference #365

shadeMe · 2023-01-25T16:13:41Z

Description

This PR addresses two issues:

When zero-length inputs passed to the transformer pipe, the outputs yielded by an attached listener has the wrong shape in the representation width dimension. This is more of a question of correctness than a practical concern due to the lack of tokens in such Docs, but we explicitly clarify why we use zero-width outputs.
Backprop callbacks and gradients were not correctly aligned to their corresponding model outputs (outputs from the transformer pipe) when empty documents occurred in the middle of a training batch. The existing tests for this corner-case were inadvertently passing due to the behaviour of zip, i.e., the test always added the empty document to the end of the batch, and the subsequent zip operation would implicitly skip the last backprop/gradient due to the latter list being shorter than the list of model outputs.

It also adds comments about the pre-conditions to clarify the circumstances under which zero-length inputs are passed to the model.

Types of change

Bug Fix

Checklist

I confirm that I have the right to submit this contribution under the project's MIT license.
I ran the tests, and all new and existing tests passed.
My changes don't require a change to the documentation, or if they do, I've added all required information.

…gth documents are correctly ignored

spacy_transformers/layers/trfs2arrays.py

…ypy` fixes

danieldk · 2023-01-30T11:28:09Z

Assigned to you Sofie, since you added the handling of zero-length docs, so it would be good get an additional sanity check.

svlandeg

Nice digging, looks great!

spacy_transformers/layers/trfs2arrays.py

shadeMe added 3 commits January 25, 2023 16:53

Allocate zeroed outputs with the correct width whenever possible

0e57d15

Add comments to clarify why zero-length/empty outputs need to be handled

5423b30

mypy fixes

a2015c7

shadeMe marked this pull request as draft January 25, 2023 17:31

shadeMe added 2 commits January 26, 2023 10:52

Aliign backprops/gradients with model outputs to ensure that zero-len…

a187fc6

…gth documents are correctly ignored

Update empty doc tests

5e6f9ca

shadeMe added the bug Something isn't working label Jan 26, 2023

shadeMe changed the title ~~Allocate zeroed outputs with the correct width whenever possible~~ Correct the handling of empty/zero-length docs during training and inference Jan 26, 2023

shadeMe changed the title ~~Correct the handling of empty/zero-length docs during training and inference~~ Correct the handling of empty/zero-length Docs during training and inference Jan 26, 2023

shadeMe marked this pull request as ready for review January 26, 2023 10:16

shadeMe mentioned this pull request Jan 27, 2023

Implement coalesced pooling over entire batches #368

Merged

3 tasks

shadeMe added the feat / pipeline Feature: Pipeline components label Jan 27, 2023

danieldk reviewed Jan 30, 2023

View reviewed changes

spacy_transformers/layers/trfs2arrays.py Outdated Show resolved Hide resolved

spacy_transformers/layers/trfs2arrays.py Outdated Show resolved Hide resolved

Explicity leave output width as zero and add a comment to clarify, `m…

637a45f

…ypy` fixes

shadeMe changed the title ~~Correct the handling of empty/zero-length Docs during training and inference~~ Correct and clarify the handling of empty/zero-length Docs during training and inference Jan 30, 2023

danieldk self-requested a review January 30, 2023 11:27

danieldk approved these changes Jan 30, 2023

View reviewed changes

svlandeg approved these changes Jan 30, 2023

View reviewed changes

spacy_transformers/layers/trfs2arrays.py Outdated Show resolved Hide resolved

Fix typo

0fd4c45

svlandeg merged commit f8bb75d into explosion:master Jan 30, 2023

shadeMe deleted the fix/zero-length-outputs branch January 30, 2023 17:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Correct and clarify the handling of empty/zero-length `Doc`s during training and inference #365

Correct and clarify the handling of empty/zero-length `Doc`s during training and inference #365

shadeMe commented Jan 25, 2023 •

edited

Loading

danieldk commented Jan 30, 2023

svlandeg left a comment

Correct and clarify the handling of empty/zero-length Docs during training and inference #365

Correct and clarify the handling of empty/zero-length Docs during training and inference #365

Conversation

shadeMe commented Jan 25, 2023 • edited Loading

Description

Types of change

Checklist

danieldk commented Jan 30, 2023

svlandeg left a comment

Choose a reason for hiding this comment

Correct and clarify the handling of empty/zero-length `Doc`s during training and inference #365

Correct and clarify the handling of empty/zero-length `Doc`s during training and inference #365

shadeMe commented Jan 25, 2023 •

edited

Loading