Fix missing pass of is_training to encode_input #108

ArneBinder · 2022-03-06T15:28:34Z

This was missed in #102.

EDIT: This rearranges the tested tokens in test_encode_input and test_collate to increase the readability of these tests.

…nput

ArneBinder · 2022-03-06T15:47:40Z

Do we also want to relax within this PR that the code does not fail if some entity, sentence or relation annotation layer is not available? Maybe just printing some warnings in these cases, e.g. "2 out of 200 documents have no relation annotations" in the case of RE or "5 of 200 documents have no sentence annotations, they will be discarded" when partition_annotation=sentences is used?

EDIT: I think this may need some discussion and should be handled in a separate PR. Related issue: #109

…in all places

ChristophAlt · 2022-03-07T07:07:58Z

tests/taskmodules/test_transformer_re_text_classification.py

@@ -505,8 +641,9 @@ def test_unbatch_output(prepared_taskmodule, model_output):

 @pytest.mark.parametrize("inplace", [False, True])
 def test_decode(prepared_taskmodule, documents, model_output, inplace):
-    encodings = prepared_taskmodule.encode(documents, encode_target=False)
+    encodings = prepared_taskmodule.encode(documents, encode_target=True)


Why did this change?

Because otherwise we would need to change the remaining of this test since encode produces now 10 instead of 2 encodings when enocde_target=False (all combinations of entities). I know, it would be better to decouple the test and do not call encode here at all, but I had not yet the energy to define simple encodings manually (which then could be passed to decode, the actual method to test here).

I think the same holds for test_encode_target. I created an issue for both cases: #110.

ArneBinder added 2 commits March 6, 2022 16:26

add missed parameter is_training

8e63d76

use is_training in TransformerRETextClassificationTaskModule.encode_i…

aaf5e8f

…nput

ArneBinder changed the title ~~Fix missing pass of is_training to encode_input~~ [WIP] Fix missing pass of is_training to encode_input Mar 6, 2022

ArneBinder changed the title ~~[WIP] Fix missing pass of is_training to encode_input~~ Fix missing pass of is_training to encode_input Mar 6, 2022

ArneBinder requested a review from ChristophAlt March 6, 2022 15:48

for consistency, we strictly require the respective annotation layer …

d7e9034

…in all places

ArneBinder added the bug Something isn't working label Mar 6, 2022

ChristophAlt reviewed Mar 7, 2022

View reviewed changes

ArneBinder mentioned this pull request Mar 7, 2022

Add train and eval mode to taskmodules #101

Open

ChristophAlt merged commit 2eb6a9e into main Mar 7, 2022

ChristophAlt deleted the fix/encode_input_is_training branch April 17, 2022 09:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix missing pass of is_training to encode_input #108

Fix missing pass of is_training to encode_input #108

ArneBinder commented Mar 6, 2022 •

edited

Loading

ArneBinder commented Mar 6, 2022 •

edited

Loading

ChristophAlt Mar 7, 2022

ArneBinder Mar 7, 2022 •

edited

Loading

ArneBinder Mar 7, 2022 •

edited

Loading

Fix missing pass of is_training to encode_input #108

Fix missing pass of is_training to encode_input #108

Conversation

ArneBinder commented Mar 6, 2022 • edited Loading

ArneBinder commented Mar 6, 2022 • edited Loading

ChristophAlt Mar 7, 2022

Choose a reason for hiding this comment

ArneBinder Mar 7, 2022 • edited Loading

Choose a reason for hiding this comment

ArneBinder Mar 7, 2022 • edited Loading

Choose a reason for hiding this comment

ArneBinder commented Mar 6, 2022 •

edited

Loading

ArneBinder commented Mar 6, 2022 •

edited

Loading

ArneBinder Mar 7, 2022 •

edited

Loading

ArneBinder Mar 7, 2022 •

edited

Loading