[ENH] add internlm2-chat-20b-ppo #207

C1rN09 · 2024-01-16T06:15:26Z

Hi, we are releasing internlm2 this week and here is our result on AlpacaEval.

The huggingface link is private now and will be public after its release. Hope to merge ASAP :)

Extra Modifications

The eos token, [UNUSED_TOKEN_145] for our model, seems to appear in the completion results. It might spoil the final score. So I added a remove_ending argument to huggingface_local_completions to strip this annoying word.

If this is not preferred, please let me know and we can work out another solution!

results/internlm2-chat-20b-ppo/reference_outputs.json

YannDubs · 2024-01-16T07:41:22Z

src/alpaca_eval/models_configs/internlm2-chat-20b-ppo/configs.yaml

+      torch_dtype: "bfloat16"
+      trust_remote_code: True
+    is_fast_tokenizer: False
+    max_new_tokens: 2048


note that we typically use 4096 for newer models. 2048 is perfectly fine just saying in case some outputs are truncated

Thanks for your advice!

Currently we haven't observed too many truncated completions. We suppose 2048 tokens is enough for our model. Responses above 2048 tokens are generally nonsense repetition, which might waste GPU time & GPT tokens

YannDubs · 2024-01-16T07:42:13Z

Very impressive results @C1rN09, I'll merge once you remove the reference_outputs!

C1rN09 added 5 commits January 16, 2024 06:01

add model config

d7b97d0

modify huggingface_local_completion to remove EOS

2153360

add results & update leaderboard

7081250

delete extra leaderboard csv

e8b579a

add docstring of remove_ending

0efc893

YannDubs reviewed Jan 16, 2024

View reviewed changes

results/internlm2-chat-20b-ppo/reference_outputs.json Outdated Show resolved Hide resolved

YannDubs reviewed Jan 16, 2024

View reviewed changes

remove reference_outputs.json

4a816e3

YannDubs merged commit a1f070b into tatsu-lab:main Jan 16, 2024
2 checks passed

C1rN09 mentioned this pull request Jan 16, 2024

prettify "pretty_name" of internlm2 #208

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] add internlm2-chat-20b-ppo #207

[ENH] add internlm2-chat-20b-ppo #207

C1rN09 commented Jan 16, 2024

YannDubs Jan 16, 2024

C1rN09 Jan 16, 2024

YannDubs commented Jan 16, 2024

[ENH] add internlm2-chat-20b-ppo #207

[ENH] add internlm2-chat-20b-ppo #207

Conversation

C1rN09 commented Jan 16, 2024

Extra Modifications

YannDubs Jan 16, 2024

Choose a reason for hiding this comment

C1rN09 Jan 16, 2024

Choose a reason for hiding this comment

YannDubs commented Jan 16, 2024