Fixed generation args issue affection OpenAI completion model #1458

Am1n3e · 2024-02-22T15:32:37Z

The task's requests use the generation kwargs as request args when the task type is generate_until as shown below

lm-evaluation-harness/lm_eval/api/task.py

Line 1067 in a72babb

arguments = (ctx, self.config.generation_kwargs)

Then, in the OpenAI Completion model, the until attribute is poped from the request args:

lm-evaluation-harness/lm_eval/models/openai_completions.py

Line 270 in a72babb

until = request_args.pop("until", ["<|endoftext|>"])

Since the object (self.config.generation_kwargs) is mutable, once the attribute is poped on the next request it will not be found and default in this case to ["<|endoftext|>"] instead to what every value was in until (provided from the task config file or the --gen_kwargs argument.

This means that only the first request will have the correct until value and any subsequent request will use the default.

haileyschoelkopf · 2024-02-22T15:36:01Z

Thanks for the bug report and fix!

Could we also add a copy prior to the popping of attributes from request_args within the OpenAI models, to be on the safe side?

lm-evaluation-harness/lm_eval/models/openai_completions.py

Line 264 in 5c4e0aa

self._max_gen_toks = request_args.pop("max_gen_toks", self.max_gen_toks)

Am1n3e · 2024-02-22T15:38:36Z

@haileyschoelkopf How about just using get in the model? This will avoid doing copies in multiple places.

Am1n3e · 2024-02-22T15:40:39Z

Something like:

            self._max_gen_toks = request_args.get("max_gen_toks", self.max_gen_toks)
            for context, _ in chunk:
                context_enc = self.tok_encode(context)
                inp = context_enc[-(self.max_length - self.max_gen_toks) :]
                inps.append(inp)

            until = request_args.get("until", ["<|endoftext|>"])
            request_args["temperature"] = request_args.get("temperature", 0)

            response = oa_completion(
                client=self.client,
                model=self.model,
                prompt=inps,
                max_tokens=self.max_gen_toks,
                stop=until,
                seed=self.seed,
                **{k: v for k, v in request_args.items() if k not in ["do_sample", "max_gen_toks"]},
            )

haileyschoelkopf · 2024-02-22T16:14:47Z

@Am1n3e Sure, that works for me!

haileyschoelkopf · 2024-02-22T17:18:21Z

Per @baberabb --test failures are due to the no-copy behavior being relied upon here: https://github.com/Am1n3e/lm-evaluation-harness-ae/blob/5c4e0aa7e4802191c60ec20862ad59d16e702457/tests/models/test_huggingface.py#L25C1-L26C71

changing the order of these two lines in the test should make it safe to do this copy where you've introduced it!

Am1n3e · 2024-02-22T21:45:43Z

@haileyschoelkopf I've addressed the comments.

haileyschoelkopf · 2024-02-22T23:53:43Z

Thank you again!

…erAI#1458) * Fixed generation args issue affection openai completion model * Fixed hf unit test; removed pop attributes in OpenAi completion. * fix format * fix format --------- Co-authored-by: Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>

Fixed generation args issue affection openai completion model

5c4e0aa

Am1n3e requested review from haileyschoelkopf and lintangsutawika as code owners February 22, 2024 15:32

Fixed hf unit test; removed pop attributes in OpenAi completion.

c682de1

haileyschoelkopf added 2 commits February 22, 2024 18:51

fix format

a672e72

fix format

77b627f

haileyschoelkopf approved these changes Feb 22, 2024

View reviewed changes

haileyschoelkopf merged commit 75ac1f4 into EleutherAI:main Feb 22, 2024
7 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed generation args issue affection OpenAI completion model #1458

Fixed generation args issue affection OpenAI completion model #1458

Am1n3e commented Feb 22, 2024

haileyschoelkopf commented Feb 22, 2024

Am1n3e commented Feb 22, 2024

Am1n3e commented Feb 22, 2024

haileyschoelkopf commented Feb 22, 2024

haileyschoelkopf commented Feb 22, 2024

Am1n3e commented Feb 22, 2024

haileyschoelkopf commented Feb 22, 2024

Fixed generation args issue affection OpenAI completion model #1458

Fixed generation args issue affection OpenAI completion model #1458

Conversation

Am1n3e commented Feb 22, 2024

haileyschoelkopf commented Feb 22, 2024

Am1n3e commented Feb 22, 2024

Am1n3e commented Feb 22, 2024

haileyschoelkopf commented Feb 22, 2024

haileyschoelkopf commented Feb 22, 2024

Am1n3e commented Feb 22, 2024

haileyschoelkopf commented Feb 22, 2024