[LIGHT] Jericho World dataset #3957

mojtaba-komeili · 2021-08-18T21:53:57Z

Patch description
Added ParlAI teachers for the JerichoWorld Dataset dataset. There are two sets of teachers for predicting (1) knowledge graph (2) actions. The focus of this patch is on knowledge graph teachers; although a simple action teacher is added as well. Here is a list of main teachers that are expected to use (not abstract etc.):

StaticKGTeacher: predict knowledge graph for a single state

ActionKGTeacher predicts mutations to the knowledge graph after an action

StateToActionTeacher: predicts player actions given the observation

JackUrb

Overall this implementation is looking good - thanks for taking the time to break down the Jericho dataset and figure out what was not quite aligning. Did you come up with an answer to why some of the target actions weren't aligning with the given content?

I've left some nits, mostly about documentation for things that aren't super clear. You don't need to necessarily always be documenting at this level, but it certainly helps for getting code OSS ready early on, and makes it easier to return to this later when we need it again.

JackUrb · 2021-08-30T19:23:11Z

parlai/tasks/jericho_world/constants.py

+
+# The set of words in the description of graph that we skip while checking
+# for overlap between knowledge grapj vertices and the description.
+GRAPH_VERT_SKIP_TOKEN = set(stopwords.words('english')) - {'you'}


Nit - it's useful to include reasonings for this kind of decision when external context drove the decision. Here the choice to leave out "you" is deliberate, but the note just describes what these tokens are used for.

Added extra explanation.

JackUrb · 2021-08-30T19:33:34Z

parlai/tasks/jericho_world/agents.py

+                    return True
+            return False
+
+        def keep_edge(edge, text_tokens):


I'm not sure I follow what your keep_edge heuristic is here - so we keep all 'you in X' and 'you have X', but otherwise we keep edges where the subject and object are in the text? Probably could use a quick one line desc to this effect.

''' Returns true if all the components of the edge are in the given text, or if it is the player location or unpruned inventory '''

Sure this documentation isn't required, but it makes it easier to grok what's happening.

Explained it in details in the method docstring.

mojtaba-komeili · 2021-08-31T13:47:18Z

Overall this implementation is looking good - thanks for taking the time to break down the Jericho dataset and figure out what was not quite aligning. Did you come up with an answer to why some of the target actions weren't aligning with the given content?

I've left some nits, mostly about documentation for things that aren't super clear. You don't need to necessarily always be documenting at this level, but it certainly helps for getting code OSS ready early on, and makes it easier to return to this later when we need it again.

I couldn't figure out many of them, so ended up dropping them if them if they were detected as incomplete. But was able to "patch" some of them to some extent by checking the step before/after. I reached out to them about clarifying, but haven't received any response yet.

mojtaba-komeili requested review from JackUrb and jaseweston August 18, 2021 21:53

facebook-github-bot added the CLA Signed label Aug 18, 2021

mojtaba-komeili force-pushed the jericho-world branch 2 times, most recently from 6324b46 to 9d40e90 Compare August 19, 2021 13:46

mojtaba-komeili added 20 commits August 23, 2021 06:22

data build

682ec77

temp agent

0725f80

state to knowledge graph

b3574ca

action teacher base

0eea956

reformat

b9ec9f9

basic teachers modified

2a6d845

room content

c8a54df

surrounding objects

b1b509b

skip example func

1b4c818

refactroing minor

6979618

graph mutations teacher

5bab550

F1 graph mutaiton metric

1d200e9

refactoring

4d7db64

readme

5f88cc8

tests

b67b4f7

debug: removing incomplete graphs

3fc807e

added task to the task list file

1ed7c05

debug: teh graph F1

bbca77e

added subject and subject-relation F1

cfbccf6

removed extra import bu IDE :X

155c7b9

mojtaba-komeili force-pushed the jericho-world branch from 77e2c07 to 155c7b9 Compare August 23, 2021 13:23

mojtaba-komeili added 4 commits August 25, 2021 14:20

graph repair + refactored

f18f058

pruning the tree

3ae22f0

prunning the static knowledge graph teacehr

7f6c70e

action to kg mutation

cfcb852

mojtaba-komeili added 6 commits August 27, 2021 12:50

the action teachers

0329761

updated teacher tests

4fbed17

updated the readme.

1f8593b

keeping inventory in the kn during the pruning

b9d8a19

readme update

fb49d97

nltk download, for passing circleCI tests

62c9383

JackUrb approved these changes Aug 30, 2021

View reviewed changes

pr comments

1b3948d

mojtaba-komeili merged commit 6c56a29 into master Aug 31, 2021

mojtaba-komeili deleted the jericho-world branch August 31, 2021 14:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LIGHT] Jericho World dataset #3957

[LIGHT] Jericho World dataset #3957

mojtaba-komeili commented Aug 18, 2021

JackUrb left a comment

JackUrb Aug 30, 2021

mojtaba-komeili Aug 31, 2021

JackUrb Aug 30, 2021

mojtaba-komeili Aug 31, 2021

mojtaba-komeili commented Aug 31, 2021

[LIGHT] Jericho World dataset #3957

[LIGHT] Jericho World dataset #3957

Conversation

mojtaba-komeili commented Aug 18, 2021

JackUrb left a comment

Choose a reason for hiding this comment

JackUrb Aug 30, 2021

Choose a reason for hiding this comment

mojtaba-komeili Aug 31, 2021

Choose a reason for hiding this comment

JackUrb Aug 30, 2021

Choose a reason for hiding this comment

mojtaba-komeili Aug 31, 2021

Choose a reason for hiding this comment

mojtaba-komeili commented Aug 31, 2021