Skip to content
This repository has been archived by the owner on Nov 3, 2023. It is now read-only.

[TOD][Dataset][Easy] Google SGD in TOD Conversations format #4181

Merged
merged 51 commits into from
Dec 22, 2021

Conversation

moyapchen
Copy link
Contributor

@moyapchen moyapchen commented Nov 16, 2021

Refactor Google SGD away from old format into TOD Conversations format.

Datasets added in this substack:

  • Google SGD
    • Google SGD Simulation Splits (In-domain, Out-domain)
  • MetalWoz
  • MSR_E2E
  • Multidogo
  • MultiWoz V2.2
  • Taskmaster
  • Taskmaster2
  • Taskmaster3 (TicketTalk)

Test plan:
Regression test, parlai dd of dataset

Moya Chen added 15 commits November 15, 2021 20:15
See documentation block in `tod_agents.py`
As noted in the README, this agent takes data generated from `tod_world_script.py` and dumps it out to a teacher.

(Note that I tried setting up a regression test for this teacher, but I ran into issues getting it to save the output directory to not be something that included my local homedir name in it..)
See documentation block in `tod_agents.py`

(I'm not 100% sure if `conftest.py` is a right file to change, though I did notice that `pytest.ini` was necessary to get pytest to run.)
See documentation in `tod_world_script.py` for usage.
Refactor Google SGD away from old format into TOD Conversations format.

Datasets added in this substack:
* *Google SGD*
   * Google SGD Simulation Splits (In-domain, Out-domain)
* MetalWoz
* MSR_E2E
* Multidogo
* MultiWoz V2.2
* Taskmaster
* Taskmaster2
* Taskmaster3 (TicketTalk)

Test plan:
Regression test, `parlai dd` of dataset
Moya Chen added 22 commits November 29, 2021 10:42
Base automatically changed from simpler_tod_4_tod_json to main December 22, 2021 17:39
@moyapchen moyapchen merged commit 5f94b6f into main Dec 22, 2021
@moyapchen moyapchen deleted the simpler_tod_5a_google_sgd branch December 22, 2021 18:08
EricMichaelSmith added a commit that referenced this pull request Dec 23, 2021
* Add flag for external DB

* [WIP] Fix crowdsourcing CI check

* [TOD] World, world metrics, script, tests (#4178)

* [TOD][straightforward] Tod world output jsonl structure to teacher task (#4180)

* [TOD][Dataset][Easy] Google SGD in TOD Conversations format (#4181)

* Add flag for external DB

* Try to fix unit test

* Minor

Co-authored-by: Moya Chen <72097364+moyapchen@users.noreply.github.com>
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants