-
Notifications
You must be signed in to change notification settings - Fork 2.1k
[TOD][Dataset][Easyish] Google Simulation Splits #4182
Conversation
See documentation block in `tod_agents.py`
As noted in the README, this agent takes data generated from `tod_world_script.py` and dumps it out to a teacher. (Note that I tried setting up a regression test for this teacher, but I ran into issues getting it to save the output directory to not be something that included my local homedir name in it..)
See documentation block in `tod_agents.py` (I'm not 100% sure if `conftest.py` is a right file to change, though I did notice that `pytest.ini` was necessary to get pytest to run.)
See documentation in `tod_world_script.py` for usage.
…iff in stack, but lost from this one
Refactor Google SGD away from old format into TOD Conversations format. Datasets added in this substack: * *Google SGD* * Google SGD Simulation Splits (In-domain, Out-domain) * MetalWoz * MSR_E2E * Multidogo * MultiWoz V2.2 * Taskmaster * Taskmaster2 * Taskmaster3 (TicketTalk) Test plan: Regression test, `parlai dd` of dataset
Code for processing Google SGD into In-domain and Out-domain data via `build.py`, using via agents. Datasets added in this substack: * Google SGD * **Google SGD Simulation Splits (In-domain, Out-domain)** * MetalWoz * MSR_E2E * Multidogo * MultiWoz V2.2 * Taskmaster * Taskmaster2 * Taskmaster3 (TicketTalk) Test plan: Regression test, `parlai dd` of dataset
…mplications for mutators too?)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Worth just doing one extra verification this reproduces the paper folds, just in case.
Stamped.
…ta and new one; relized I was missing a +1 in the episode length count
Ran an eval on the new dataset with the old models; get the same numbers. Also with train: 12:53:39 | loaded 2303 episodes with a total of 60589 examples vs `parlai dd -t fb:google_sgd_rl_splits:OutDomainSystemTeacher' train: 12:22:41 | loaded 2303 episodes with a total of 60589 examples |
Code for processing Google SGD into In-domain and Out-domain data via
build.py
, using via agents.Datasets added in this substack:
Test plan:
Regression test,
parlai dd
of dataset