Skip to content
This repository has been archived by the owner on Nov 3, 2023. It is now read-only.

SaferDialogues parallel test and valid set with BAD dataset #4623

Merged
merged 9 commits into from
Jul 26, 2022

Conversation

meganung
Copy link
Contributor

Adding a new teacher SaferDialoguesBAD that has a different test and valid set that is parallel with the BAD dataset (train set is the same amongst both teachers)

Added test for the new teacher, pytest passes.

@meganung meganung requested a review from jxmsML June 21, 2022 14:54
@meganung meganung force-pushed the saferdialogues_bad_testset branch from 70875d8 to c458039 Compare June 27, 2022 19:18
@meganung meganung requested a review from klshuster July 7, 2022 17:52

RESOURCES = [
DownloadableFile(
f'http://parl.ai/downloads/saferdialogues/saferdialogues_{SAFERDIALOGUES_VERSION}.tar.gz',
f'saferdialogues_{SAFERDIALOGUES_VERSION}.tar.gz',
'7ecf062fdf3e0ddfd16c52998341f543331e0dbaa1fcdb27026e8b0b80446fed',
'3d1bc731fb0c63d9f61a52b3d5f1aab0911cda1ca38f49822c79c18a2dc8e834',
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

will the new teacher be overriding the previous version (the one with the old valid and test split), I wonder if it would be nicer to keep both splits (one for reproducibility in your paper, and one here)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

no it's not overriding it, the new file just includes all the data (original for reproducibility) and the new data for the new teacher that this PR introduces. The default task points to the original teacher + data from the paper. To use the new data where the splits are parallel with the BAD dataset, need to explicitly use SaferDialoguesBAD teacher.

Copy link
Contributor

@klshuster klshuster left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks great just one q

@@ -1,9 +1,10 @@
#!/usr/bin/env python3

# Copyright (c) Meta Platforms, Inc. and affiliates.
# Copyright (c) Facebook, Inc. and its affiliates.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we need to update our copyrights @stephenroller ?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes we do

@@ -1,86 +1,66 @@
acts:
- - episode_done: true
eval_labels:
- I agree with that, they're kids after all.
- i am also a fun maker
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why did the train/val/test values change in the original dataset?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

good catch, it was just shuffled but should be updated so it's identical

@stephenroller
Copy link
Contributor

Can you rebase/merge main in?

@meganung meganung force-pushed the saferdialogues_bad_testset branch from 77f513e to b8365cb Compare July 15, 2022 17:58
@@ -10,13 +10,13 @@
import parlai.core.build_data as build_data
import os

SAFERDIALOGUES_VERSION = "v0.1"
SAFERDIALOGUES_VERSION = "v0.4"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LOL what happened to version 0.2 ~ 0.3

Copy link
Contributor

@jxmsML jxmsML left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!!! 🚀

@meganung meganung merged commit bd8e01a into main Jul 26, 2022
@meganung meganung deleted the saferdialogues_bad_testset branch July 26, 2022 18:56
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants