Model CONTEXT_LENGTH in ORPOTrainer #2140

abpani · 2024-09-27T18:05:49Z

abpani
Sep 27, 2024

I am trying to finetune using ORPOTrainer.
I have a question that if I have 1 chosen answer and 10 rejected answers and my CONTEXT LENGTH for chosen answer is 8192 then does it increase the prompt length for rejected answers by 10 times?
How does that work in the backend in terms of context length.
As you have to create user, assistant pairs for all rejected answers.

Eg: https://huggingface.co/datasets/mlabonne/orpo-dpo-mix-40k/viewer/default/train?p=1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model CONTEXT_LENGTH in ORPOTrainer #2140

{{title}}

Replies: 0 comments

Select a reply

Model CONTEXT_LENGTH in ORPOTrainer #2140

abpani Sep 27, 2024

Replies: 0 comments

abpani
Sep 27, 2024