Add `training.before_update` callback #11739

shadeMe · 2022-11-02T17:01:32Z

Description

This callback can be used to implement training paradigms like gradual (un)freezing of components (e.g: the Transformer) after a certain number of training steps to mitigate catastrophic forgetting during fine-tuning.

Types of change

New feature

Checklist

I confirm that I have the right to submit this contribution under the project's MIT license.
I ran the tests, and all new and existing tests passed.
My changes don't require a change to the documentation, or if they do, I've added all required information.

This callback can be used to implement training paradigms like gradual (un)freezing of components (e.g: the Transformer) after a certain number of training steps to mitigate catastrophic forgetting during fine-tuning.

spacy/schemas.py

danieldk

Did you find out if there is an easy way to test this?

spacy/training/loop.py

shadeMe · 2022-11-09T12:53:07Z

Did you find out if there is an easy way to test this?

Nothing short of bundling a small, toy corpus file and invoking the train and init_nlp functions like I do in the gradual transformer unfreezing PR.

adrianeboyd · 2022-11-17T08:11:14Z

Can you add some minimal tests for this? It doesn't look like there are any training-specific callback tests yet, but there are similar tests for the [nlp] callbacks.

adrianeboyd · 2022-11-18T12:01:08Z

I think you should be able to test this with train_while_improving without a separate .spacy file? (I'm pretty sure the tests are going to fail because it's not automatically included in the package, and we'd want to be careful in general about adding a lot of extra data to the test suite.)

shadeMe · 2022-11-18T12:13:30Z

Ah, didn't realize the extra data wouldn't be included anyway. I'll try out your suggestion.

adrianeboyd · 2022-11-18T12:44:03Z

It's not that we can't add extra data with some additional settings, and it might make sense to have one toy .spacy file that's shared between different tests.

As I've mentioned a few times, the extra CLI tests in the CI should probably be moved into a proper test suite that's separate from the package tests and we can have the CI run both, and that would be an easier place for longer/larger tests.

spacy/tests/training/test_training.py

spacy/tests/conftest.py

spacy/tests/training/test_training.py

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

spacy/tests/training/test_training.py

Add training.before_update callback

f64dbe6

This callback can be used to implement training paradigms like gradual (un)freezing of components (e.g: the Transformer) after a certain number of training steps to mitigate catastrophic forgetting during fine-tuning.

shadeMe requested a review from danieldk November 2, 2022 17:01

shadeMe commented Nov 2, 2022

View reviewed changes

spacy/schemas.py Outdated Show resolved Hide resolved

shadeMe added feat / pipeline Feature: Processing pipeline and components feat / training Feature: Training utils, Example, Corpus and converters labels Nov 2, 2022

shadeMe marked this pull request as draft November 2, 2022 17:34

Fix type annotation, default config value

43e5fb7

shadeMe marked this pull request as ready for review November 3, 2022 09:30

shadeMe mentioned this pull request Nov 3, 2022

Add gradual_transformer_unfreezing callback explosion/curated-transformers#49

Merged

shadeMe added 2 commits November 7, 2022 11:29

Generalize arguments passed to the callback

e477eb2

Update schema

1f0863c

danieldk reviewed Nov 9, 2022

View reviewed changes

spacy/training/loop.py Outdated Show resolved Hide resolved

Pass epoch to callback, rename current_step to step

e19b490

adrianeboyd added the v3.5 Related to v3.5 label Nov 11, 2022

danieldk approved these changes Nov 11, 2022

View reviewed changes

Add test

aa921ff

Simplify test

d6d5c52

adrianeboyd reviewed Nov 18, 2022

View reviewed changes

spacy/tests/training/test_training.py Outdated Show resolved Hide resolved

Replace config string with spacy.blank

3fd19b8

adrianeboyd reviewed Nov 18, 2022

View reviewed changes

spacy/tests/training/test_training.py Outdated Show resolved Hide resolved

adrianeboyd reviewed Nov 23, 2022

View reviewed changes

spacy/tests/conftest.py Outdated Show resolved Hide resolved

adrianeboyd reviewed Nov 23, 2022

View reviewed changes

spacy/tests/training/test_training.py Outdated Show resolved Hide resolved

Apply suggestions from code review

7fe3c5e

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

adrianeboyd reviewed Nov 23, 2022

View reviewed changes

spacy/tests/training/test_training.py Outdated Show resolved Hide resolved

Cleanup imports

e59af45

adrianeboyd merged commit 5ea14af into explosion:master Nov 23, 2022

shadeMe deleted the feature/new-training-step-callback branch November 23, 2022 17:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `training.before_update` callback #11739

Add `training.before_update` callback #11739

shadeMe commented Nov 2, 2022 •

edited

Loading

danieldk left a comment

shadeMe commented Nov 9, 2022

adrianeboyd commented Nov 17, 2022

adrianeboyd commented Nov 18, 2022

shadeMe commented Nov 18, 2022

adrianeboyd commented Nov 18, 2022

Add training.before_update callback #11739

Add training.before_update callback #11739

Conversation

shadeMe commented Nov 2, 2022 • edited Loading

Description

Types of change

Checklist

danieldk left a comment

Choose a reason for hiding this comment

shadeMe commented Nov 9, 2022

adrianeboyd commented Nov 17, 2022

adrianeboyd commented Nov 18, 2022

shadeMe commented Nov 18, 2022

adrianeboyd commented Nov 18, 2022

Add `training.before_update` callback #11739

Add `training.before_update` callback #11739

shadeMe commented Nov 2, 2022 •

edited

Loading