Fine tune Llama2 with Lora for foreign language #592

konsalex · 2023-10-17T10:30:49Z

konsalex
Oct 17, 2023

Hey folks,

I watched a YouTube video, about how some LLMs tokenise languages other than English.

For example for the Greek language you will see that this is failing totally, as one character is one token always:

My question is, if I would fine-tune Llama2 with Lora based on Greek text, would the tokeniser change and work properly? Or the fine tune would not work as the tokeniser cannot be retrained/tuned?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fine tune Llama2 with Lora for foreign language #592

{{title}}

Replies: 0 comments

Select a reply

Fine tune Llama2 with Lora for foreign language #592

konsalex Oct 17, 2023

Replies: 0 comments

konsalex
Oct 17, 2023