Skip to content

Breaking change of models since PR #252 #324

Closed as not planned
Closed as not planned
@PriNova

Description

@PriNova

After the PR #252, all base models need to be converted new.

For me, this is a big breaking change. The LoRa and/or Alpaca fine-tuned models are not compatible anymore.
Reconverting is not possible.

I see from the PR, that the tokenizer scores are written into the model.
Would it make sense to write the tokenizer scores into a seperate file to stay compatible with the (old) models?
The question then arrises, if

  1. by loading the model the scoring file will be checked of existense and the sentencepiece tokenizer will be used, or
  2. the user can decide which tokenizer to use.

What you think?

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingmodelModel specific

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions