Trainer trains the Model in-place #526

n1t0 · 2020-11-13T18:00:12Z

When training a Tokenizer, its Model gets replaced after training since the Trainer generates a new Model. This has several limitations:

When the Model being replaced has been customized (dropout, unk_token, ...), we lose all of this when we replace it (cf BPE dropout not working as expected #201)
In Python, if we keep a reference to the model added to the Tokenizer, this reference does not point to the actual model used by the Tokenizer after training.

Change the Trainer to actually train to Model in-place.

The text was updated successfully, but these errors were encountered:

This was referenced Nov 13, 2020

Trainer improvements #519

Merged

Training improvements #528

Closed

n1t0 added the enhancement New feature or request label Nov 13, 2020

n1t0 closed this as completed in #519 Nov 20, 2020

Provide feedback