[Safety] Fix a Static Task bug and Safety README #3612

jxmsML · 2021-04-21T17:20:07Z

Patch description
Patch on safety human safety

Just notice that the UI always enable response field in static task (if responseField !==null, which is always true given the default value for responseField is False), add a fix to it.
More explanation on how to run human safety evaluation, with new script parsing world logs to human eval ready format.
change logging info -> warn once, otherwise the loggings output is nasty for bs > 1

Patch on world logging (moved to #3674), this branch is rebased on the feature branch convo_log_pad in #3674

filter by message.is_padding() when write episodes to self._current_episodes.
modify the test tests/test_eval_model.py on test_save_report.

Testing steps

Other information

emilydinan · 2021-04-21T18:15:48Z

projects/safety_recipes/README.md

+python projects/safety_recipes/human_safety_evaluation/format_safety_ready.py --world-logs-path tmp/world_logs.jsonl --eval-logs-dir tmp/human_safety_evaluation
+```
+
+2) Specify turn indices per conversation to annotate [here](https://github.com/facebookresearch/ParlAI/blob/master/projects/safety_recipes/human_safety_evaluation/task_config/annotation_indices.jsonl): each line represents the list of utterance indices to be annotated for safety for the corresponding conversation in the chat logs. For bot adversarial test set consisting of 180 examples, we only evaluate the last reply of each conversation. 


is this part necessary if we use the command above?

if yes, can we make it automated? if no, can we make it clear that it's not necessary?

ah yes. I just edit this ( running format_safety_ready.py should automatically generate the annotation_indices.jsonl as well as the task_data.jsonl)

emilydinan · 2021-04-21T18:16:24Z

projects/safety_recipes/human_safety_evaluation/format_safety_ready.py

+    with PathManager.open(world_logs_path) as data_file:
+        for l in data_file.readlines():
+            episode = json.loads(l.strip())
+            # TODO: when conversation format is finished please remove this line;


what does this mean?

It seems like there is a bug in conversation format that would generate lines in the world_logs as following:

{"dialog": [[{"batch_padding": true, "episode_done": true, "id": "bot_adversarial_dialogue:HumanSafetyEvaluation.persona_False_flatten_False"}, {"id": "TransformerGenerator", "episode_done": false}]], "context": [], "metadata_path": "tmp/world_logs.metadata"}

I added a hack to skip those when parsing but, there is room for removing that hack after the bug above is fixed.

emilydinan

seems ready to go, but let's wait to merge until the aforementioned bug is fix so we can get rid of that comment/hack and rebase on top

jxmsML · 2021-04-28T22:50:46Z

parlai/utils/world_logging.py

@@ -74,12 +75,17 @@ def _add_msgs(self, acts, idx=0):
        """
        msgs = []
        for act in acts:
+            # padding examples in the episode[0]
+            if isinstance(act, Message) and act.is_padding():


only filter out if act is Message otherwise it'll break the unittests for act is dict

jxmsML · 2021-05-26T20:50:44Z

This pr is rebased on #3674. (separate the changes on safety test and world log saving)

facebook-github-bot added the CLA Signed label Apr 21, 2021

jxmsML force-pushed the safetyfix branch from 89e9083 to 570181c Compare April 21, 2021 18:04

jxmsML requested review from emilydinan and meganung April 21, 2021 18:06

emilydinan reviewed Apr 21, 2021

View reviewed changes

jxmsML force-pushed the safetyfix branch 3 times, most recently from 6410807 to da18ff9 Compare April 21, 2021 18:18

jxmsML changed the title ~~Fix a Static Task bug and Safety README~~ [Safety] Fix a Static Task bug and Safety README Apr 21, 2021

emilydinan approved these changes Apr 21, 2021

View reviewed changes

jxmsML requested a review from stephenroller April 28, 2021 22:36

jxmsML commented Apr 28, 2021

View reviewed changes

jxmsML force-pushed the safetyfix branch from 99aa163 to ce9dd56 Compare May 26, 2021 20:40

Jing Xu added 5 commits May 26, 2021 20:41

patch on world logging

e60852c

readme

73065ae

format

a67dc31

small lint fix

c6fb6b0

revert and apply safety changes only

2533ed8

jxmsML force-pushed the safetyfix branch from ce9dd56 to 2533ed8 Compare May 26, 2021 20:46

jxmsML changed the base branch from master to convo_log_pad May 26, 2021 20:51

Base automatically changed from convo_log_pad to master June 4, 2021 00:29

Merge branch 'master' into safetyfix

59e723e

jxmsML merged commit 3cd8646 into master Jul 7, 2021

jxmsML deleted the safetyfix branch July 7, 2021 14:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Safety] Fix a Static Task bug and Safety README #3612

[Safety] Fix a Static Task bug and Safety README #3612

jxmsML commented Apr 21, 2021 •

edited

Loading

emilydinan Apr 21, 2021

emilydinan Apr 21, 2021

jxmsML Apr 21, 2021

emilydinan Apr 21, 2021

jxmsML Apr 21, 2021 •

edited

Loading

emilydinan left a comment

jxmsML Apr 28, 2021

jxmsML commented May 26, 2021

[Safety] Fix a Static Task bug and Safety README #3612

[Safety] Fix a Static Task bug and Safety README #3612

Conversation

jxmsML commented Apr 21, 2021 • edited Loading

emilydinan Apr 21, 2021

Choose a reason for hiding this comment

emilydinan Apr 21, 2021

Choose a reason for hiding this comment

jxmsML Apr 21, 2021

Choose a reason for hiding this comment

emilydinan Apr 21, 2021

Choose a reason for hiding this comment

jxmsML Apr 21, 2021 • edited Loading

Choose a reason for hiding this comment

emilydinan left a comment

Choose a reason for hiding this comment

jxmsML Apr 28, 2021

Choose a reason for hiding this comment

jxmsML commented May 26, 2021

jxmsML commented Apr 21, 2021 •

edited

Loading

jxmsML Apr 21, 2021 •

edited

Loading