issues when run `disinformation` #1474

lumosity4tpj · 2023-04-11T03:05:20Z

when I run the disinformation like run_specs.conf#L538-L540, I have two issues：

capability=reiteration：raise helm.benchmark.executor.ExecutorError: HuggingFace error: num_return_sequences has to be 1, but is 5 when doing greedy search. so the settings don't match, this problem also occurs when run real_toxicity_prompts.
capability=wedging：raise ValueError: More than one stop sequence is not supported. I find this to define stop_sequence as a list of two lengths, but this only allows one length.

When I looked at the corresponding references this this I still wondered how to solve these problems.

The text was updated successfully, but these errors were encountered:

lumosity4tpj · 2023-04-14T08:54:43Z

when I run the disinformation like run_specs.conf#L538-L540, I have two issues：

capability=reiteration：raise helm.benchmark.executor.ExecutorError: HuggingFace error: num_return_sequences has to be 1, but is 5 when doing greedy search. so the settings don't match, this problem also occurs when run real_toxicity_prompts.

capability=wedging：raise ValueError: More than one stop sequence is not supported. I find this to define stop_sequence as a list of two lengths, but this only allows one length.

When I looked at the corresponding references this this I still wondered how to solve these problems.

For the first point, I'm sorry that it was caused by my mistake in choosing the wrong generate mode of huggingface, so it is not a problem;
For the second point, I think that if the length>1, it should be optimized to not pass the stop_sequence as eos_token, but should return the result when the maximum length or the eos of the original tokenizer is reached, and judge whether to truncate outside the model.generate. Is the reason we don't do this because we only want stop sentence as eos and ignore the original tokenizer's eos?
Moreover, I found that some tokenizers will tokenize stop_sequence (length==1) tokenize into two ids, then this will report an error, which can also be solved by the above method.

If you would give me some answers, I would really appreciate it

yifanmai · 2024-08-08T17:51:25Z

This looks like a duplicate of #1501 which was fixed in #1892 i.e. HuggingFaceClient failing if the stop sequence is longer than one token.

JosselinSomervilleRoberts added the user question label May 10, 2023

yifanmai closed this as completed Aug 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

issues when run `disinformation` #1474

issues when run `disinformation` #1474

lumosity4tpj commented Apr 11, 2023

lumosity4tpj commented Apr 14, 2023

yifanmai commented Aug 8, 2024

issues when run disinformation #1474

issues when run disinformation #1474

Comments

lumosity4tpj commented Apr 11, 2023

lumosity4tpj commented Apr 14, 2023

yifanmai commented Aug 8, 2024

issues when run `disinformation` #1474

issues when run `disinformation` #1474