[Bug]: TarsModels do redownload embeddings #3207

helpmefindaname · 2023-04-20T15:48:45Z

Describe the bug

related but not the same as: #3167

Locally saving a TarsModel won't save the huggingface config and therefore requires internet connection & a longer loading time when loading it on a new machine without hf-cache.
This happens due to tars pickeling the internal SequenceTagger/TextClassifier instead of just seralizing their embeddings.

To Reproduce

from flair.models import TARSClassifier

model = TARSClassifier.load("tars-base")
model.tars_embeddings.model.config._name_or_path = "bert-base-uncased"
model.tars_embeddings.base_model_name = "bert-base-uncased"
model.tars_embeddings.name = "transformer-bert-base-uncased"
model.save("local-tars-base.pt")

# clear huggingface cache or copy `local-tars-base.pt` to another machine or docker container.

model.load("local-tars-base.pt")

Expected behavior

The model should load without the need of internet and it shouldn't require me to wait until the embedding config and weights are downloaded.

Logs and Stack traces

Downloading (…)okenizer_config.json: 100%|██████████| 28.0/28.0 [00:00<00:00, 17.2kB/s]
Downloading (…)lve/main/config.json: 100%|██████████| 570/570 [00:00<00:00, 517kB/s]
Downloading (…)solve/main/vocab.txt: 100%|██████████| 232k/232k [00:00<00:00, 2.12MB/s]
Downloading (…)/main/tokenizer.json: 100%|██████████| 466k/466k [00:00<00:00, 3.61MB/s]
Downloading pytorch_model.bin: 100%|██████████| 440M/440M [00:35<00:00, 12.5MB/s]

Screenshots

No response

Additional Context

No response

Environment

Versions:

Flair

0.12.2 (master branch)

Pytorch

2.0.0+cu117

Transformers

4.28.1

GPU

True

The text was updated successfully, but these errors were encountered:

helpmefindaname added the bug Something isn't working label Apr 20, 2023

helpmefindaname pushed a commit that referenced this issue Apr 21, 2023

gh-3207: serialize tars model to not redownload models

0a89107

helpmefindaname mentioned this issue Apr 21, 2023

Fix tars loading #3212

Merged

alanakbik closed this as completed in #3212 Apr 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: TarsModels do redownload embeddings #3207

[Bug]: TarsModels do redownload embeddings #3207

helpmefindaname commented Apr 20, 2023

[Bug]: TarsModels do redownload embeddings #3207

[Bug]: TarsModels do redownload embeddings #3207

Comments

helpmefindaname commented Apr 20, 2023

Describe the bug

To Reproduce

Expected behavior

Logs and Stack traces

Screenshots

Additional Context

Environment

Versions:

Flair

Pytorch

Transformers

GPU