gh-3097: fix multitask model training #3101

helpmefindaname · 2023-02-13T12:55:33Z

This PR fixes #3097 by introducing an evaluate_all parameter. That way, users can use multitask_model.evaluate(corpus.test, gold_label_type="Task_0", evaluate_all=False) to evaluate the first task. If evaluate_all is set to true, the value of gold_label_type will be ignored and every task will be evaluated on their respective label type.

While doing this implementation I also noticed, that the evaluation currently evaluates sentences that have multiple tasks annotated to be evaluated for only one random task. -> now the evaluation uses all sentences for all tasks they are assigned to as it should be expected.

I also added the possibilities to compute the loss of a sentence on all assigned tasks during training. Hence Training Knowledge Graph Construction (NER + Relation Extraction + NEL) with a shared transformer embedding, would lead to only slightly increased training time compared to NER only, while leaveraging information of all 3 tasks at any time.

I found a bug where the embeddigns where not embedded right, as the identify_dynamic_embeddings only looks at the first sentence of the batch which could be unenbedded (e.g. for relation extraction or NEL but has no NER labels). Now the logic searches the whole batch and continues searching until it finds a batch that contains ANY embedding.

… the same sentence to the transformers embedding twice in a single training pass, shouldn't produce different results.

…gs if the first sentence was not embedded.In general it searches for the first batch that contains embeddings

alanakbik · 2023-02-14T09:55:50Z

@helpmefindaname thanks a lot for fixing and improving this!

Benedikt Fuchs added 6 commits February 13, 2023 13:30

gh-3097: fix multitask model training

c73d81f

don't for reembedding when the embedding is not static. (E.g. passing…

12e513a

… the same sentence to the transformers embedding twice in a single training pass, shouldn't produce different results.

fix multitask model still using gold_label_type parameter

213e898

fix identify_dynamic_embeddings logic to also find dynamic embeddin…

c1add51

…gs if the first sentence was not embedded.In general it searches for the first batch that contains embeddings

make dynamic embeddings unique

f9f1cd9

fix typing

d5a14c1

alanakbik merged commit f364e0e into master Feb 14, 2023

alanakbik deleted the bg-3097/fix_multi_task_training branch February 14, 2023 09:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-3097: fix multitask model training #3101

gh-3097: fix multitask model training #3101

helpmefindaname commented Feb 13, 2023 •

edited

Loading

alanakbik commented Feb 14, 2023

gh-3097: fix multitask model training #3101

gh-3097: fix multitask model training #3101

Conversation

helpmefindaname commented Feb 13, 2023 • edited Loading

alanakbik commented Feb 14, 2023

helpmefindaname commented Feb 13, 2023 •

edited

Loading