Sequence labeling refactoring #2361

whoisjones · 2021-07-31T21:35:43Z

Closes #2360.

… functions

…moved batch loss average). running version, ready for PR.

…nce_labeling_refactoring � Conflicts: � flair/models/sequence_tagger_model.py

helpmefindaname · 2021-11-16T15:32:08Z

@whoisjones is there any status update on this?
if not, do you mind me creating a PR based on this?

whoisjones · 2021-11-17T10:25:08Z

@helpmefindaname currently shifting the sequence labeler below the DefaultClassifier. We still need a parser for previous models, so it still takes some days, but feel free to contribute on this branch.

…nce_labeling_refactoring

alanakbik · 2021-12-14T10:08:34Z

flair/models/sequence_tagger_model.py

-
+from .sequence_tagger_utils.crf import CRF
+from .sequence_tagger_utils.viterbi import ViterbiLoss, ViterbiDecoder
+from ..datasets import DataLoader, SentenceDataset


why these relative module paths? why not flair.datasets?

Sequence tagger speedups

helpmefindaname · 2021-12-15T13:51:18Z

flair/models/sequence_tagger_model.py

-        pad_start_tags = torch.cat([start, tags], 1)
-        pad_stop_tags = torch.cat([tags, stop], 1)
+            # filter empty sentences
+            if isinstance(sentences[0], Sentence):


the if check souldn't be required, as sentences is always of type List[Sentence] if typing isn't violated

as yes, right! I'll push a correction, thanks!

helpmefindaname · 2021-12-15T13:54:07Z

flair/models/sequence_tagger_model.py

-        for i in range(len(lens_)):
-            pad_stop_tags[i, lens_[i] :] = self.tag_dictionary.get_idx_for_item(STOP_TAG)
+            # order by length
+            reordered_sentences: List[Union[Sentence, str]] = sorted(sentences, key=lambda s: len(s), reverse=True)


The Union doesn't make sense, as sentences is of type List[Sentence] we will always have reordered_sentences: List[Sentence] also, I think mypy is able to auto-infer the type of reordered_sentences so the typing might not be necessary

also a good point, this is I think a leftover from times when str could also be passed to the predict function!

alanakbik · 2021-12-16T13:17:06Z

@whoisjones thanks a lot for improving this!

whoisjones added 11 commits July 31, 2021 20:33

added utils folder and CRF class

dd4adde

added Viterbi classes

0419de4

added the sequence labeling utils.py containing required mathematical…

95b9fa3

… functions

initial commit refactored sequence tagger

51d6c32

added start / stop tags

eb3e0a3

initial adjustments to new Classifier abstract class

822c79e

added get sequence tensor method

f292931

added required label property for new Classifier class interface

79ed883

adjust predict method to new Classifier interface

7ad6380

initial running version

2f65eb9

changes to loss function (loss averaging for trainer) and viterbi (re…

3aba2e9

…moved batch loss average). running version, ready for PR.

whoisjones mentioned this pull request Aug 3, 2021

Adapt pretrained sequence tagger models for refactoring #2362

Closed

whoisjones and others added 3 commits August 3, 2021 10:52

fix integration test erros.

e396a2c

remove testing file.

6d41a9b

Compatibility changes

9bc7fbb

BrambleXu mentioned this pull request Sep 3, 2021

Refactoring of Sequence Tagger Class #2360

Closed

whoisjones added 3 commits November 12, 2021 23:57

Merge branch 'master' of https://github.com/flairNLP/flair into seque…

669e2f2

…nce_labeling_refactoring � Conflicts: � flair/models/sequence_tagger_model.py

merge

d8135ba

adjustments for DefaultClassifier

b8004e4

sequence tagger adaption for DefaultClassifier

7106703

whoisjones added 8 commits November 17, 2021 11:40

inference with ViterbiDecoder

ec700eb

Viterbi target formatting

51ab3a6

fix loss logging after each epoch

abd08bb

fix load and save method

93b5241

Merge branch 'master' of https://github.com/flairNLP/flair into seque…

e2bf379

…nce_labeling_refactoring

fix init method in order to load models with previous SequenceTagger

013e484

added store_embeddings to predict in order to save memory

17f6299

adjustments for linear layer into tag space

68a9138

whoisjones added 8 commits December 8, 2021 16:02

change transitions to be on CPU if using CUDA

9fae78a

transitions not always on same device if using CUDA

c01dc1d

transitions not always on same device if using CUDA

0e54fa2

fix: initialize transitions

e81f7c6

fix: standard inference via softmax

b6cedd1

SequenceTagger documentation

c3b3f5c

refactorings

89d9f34

refactorings

3acdbc0

alanakbik reviewed Dec 14, 2021

View reviewed changes

alanakbik and others added 9 commits December 14, 2021 11:31

try different sentence tensor method

7de0492

use different tensor creation method

46f082e

Merge pull request #2550 from flairNLP/sequence_tagger_speedups

335b082

Sequence tagger speedups

Merge branch 'master' into sequence_labeling_refactoring

02e3a61

Merge branch 'master' into sequence_labeling_refactoring

eacf136

Fix merge errors

312fb1c

update formatting to 120 length

cea8a31

Update instructions for formatting

9e7efe4

Fix empty sentence error

3837e72

helpmefindaname reviewed Dec 15, 2021

View reviewed changes

alanakbik added 6 commits December 15, 2021 14:54

Remove unnecessary if-check

9190c06

Remove typing

3386533

Unified final linear map

e637792

Inherit from Classifier

6cd3e5a

Undo error caused by moving _get_gold_labels out

f13fab3

Black formatting

2a0eee5

alanakbik merged commit 1d65cf4 into master Dec 16, 2021

alanakbik deleted the sequence_labeling_refactoring branch December 16, 2021 13:16

mauryaland mentioned this pull request Mar 16, 2023

[Question]: #3151

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sequence labeling refactoring #2361

Sequence labeling refactoring #2361

whoisjones commented Jul 31, 2021

helpmefindaname commented Nov 16, 2021

whoisjones commented Nov 17, 2021

alanakbik Dec 14, 2021

helpmefindaname Dec 15, 2021

alanakbik Dec 15, 2021

helpmefindaname Dec 15, 2021

alanakbik Dec 15, 2021

alanakbik commented Dec 16, 2021

Sequence labeling refactoring #2361

Sequence labeling refactoring #2361

Conversation

whoisjones commented Jul 31, 2021

helpmefindaname commented Nov 16, 2021

whoisjones commented Nov 17, 2021

alanakbik Dec 14, 2021

Choose a reason for hiding this comment

helpmefindaname Dec 15, 2021

Choose a reason for hiding this comment

alanakbik Dec 15, 2021

Choose a reason for hiding this comment

helpmefindaname Dec 15, 2021

Choose a reason for hiding this comment

alanakbik Dec 15, 2021

Choose a reason for hiding this comment

alanakbik commented Dec 16, 2021