`Llama2-7B-chat-4k` on `PassageRetrieval-zh` gets `10.12` #61

fuqichen1998 · 2024-03-18T21:52:02Z

As the title, my evaluation of Llama2-7B-chat-4k on PassageRetrieval-zh gets 10.12, which is significantly higher than the README (0.5), could you please share why?

The text was updated successfully, but these errors were encountered:

bys0318 · 2024-03-20T09:28:04Z

Hi! Are you using the prompt template as in config/dataset2prompt.json?

bys0318 · 2024-03-20T09:30:12Z

We refer to our code here for the llama2 prompt: https://github.com/THUDM/LongBench/blob/main/pred.py#L33

fuqichen1998 · 2024-03-20T17:22:19Z

Yes, I was using your pred.py to run the inference and evaluation.

slatter666 · 2024-05-09T05:51:14Z

Yes, I was using your pred.py to run the inference and evaluation.

Acutally I also get the same result

condy0919 · 2024-09-06T02:06:06Z

We refer to our code here for the llama2 prompt: https://github.com/THUDM/LongBench/blob/main/pred.py#L33

The INST is necessary for llama2-7b/llama2-13b?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`Llama2-7B-chat-4k` on `PassageRetrieval-zh` gets `10.12` #61

`Llama2-7B-chat-4k` on `PassageRetrieval-zh` gets `10.12` #61

fuqichen1998 commented Mar 18, 2024

bys0318 commented Mar 20, 2024

bys0318 commented Mar 20, 2024

fuqichen1998 commented Mar 20, 2024

slatter666 commented May 9, 2024

condy0919 commented Sep 6, 2024

Llama2-7B-chat-4k on PassageRetrieval-zh gets 10.12 #61

Llama2-7B-chat-4k on PassageRetrieval-zh gets 10.12 #61

Comments

fuqichen1998 commented Mar 18, 2024

bys0318 commented Mar 20, 2024

bys0318 commented Mar 20, 2024

fuqichen1998 commented Mar 20, 2024

slatter666 commented May 9, 2024

condy0919 commented Sep 6, 2024

`Llama2-7B-chat-4k` on `PassageRetrieval-zh` gets `10.12` #61

`Llama2-7B-chat-4k` on `PassageRetrieval-zh` gets `10.12` #61