Evaluating on My Own Datasets #83

lyy1994 · 2023-02-12T15:15:57Z

Hi, I am trying to evaluate GLM-130B in our own datasets. I follow this guide to convert the CHID in FewCLUE to see whether my implementation is correct. However, I found that removing the tokenized inputs and choices in the released dataset does not lead to the same result reported in the paper. I think the inputs_pretokenized and choices_pretokenized are not properly tokenized and want to know how to ensure correct tokenization.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluating on My Own Datasets #83

Evaluating on My Own Datasets #83

lyy1994 commented Feb 12, 2023

Evaluating on My Own Datasets #83

Evaluating on My Own Datasets #83

Comments

lyy1994 commented Feb 12, 2023