Add file for helper metrics for slots #3138

moyapchen · 2020-10-01T01:49:31Z

SlotF1Metric + SlotF1Metric test imported from an internal implementation. Will go through and refactor + verify that I don't break existing tasks using this next.

Testing steps
`pytest tests/SlotF1Metric + SlotF1Metric test imported from an internal implementation. Will go through and refactor + verify that I don't break existing tasks using this next.

Testing steps
pytest tests/test_slot_metrics.py

Note that I did change the SlotF1Metric class (and its associated test) a bit in the 2nd change in this PR in order to get tests to run properly. (I got assertion errors trying to run the pytest test_slot_f1_metrics.py in its original internal directory so figured it would need to change anyhow.)

moyapchen · 2020-10-01T01:57:54Z

Well, that's a bunch of automatically caught bugs. Time to put this back into draft mode...

moyapchen · 2020-10-01T06:17:23Z

Putting this ready for review since logic seems good.

Note that running the autoformat.sh script locally either does nothing if I do ./../autoformat.sh from ParlAI/parlai or crashes my terminal window if I do source autoformat.sh (adding a path prefix to autoformat as necessary) from any directory...

parlai/utils/task_dialogue_helpers/metrics.py

stephenroller · 2020-10-01T14:56:00Z

parlai/utils/task_dialogue_helpers/metrics.py

+        # we drop Python 3.6
+        if other is None:
+            return self
+        slot_p = _average_type_sum_helper(self._slot_p, other._slot_p)


do we get Nones?

We might if both arguments are Nones.

Ya I just don't understand if/when that happens

Ahhh will add a comment about this to clarify.

Basically if you look at SlotMetrics::__init__(), it's clever about how it aggregates F1 metric components, accumulating the precision and recall values separately. However, this means there might be Nones floating around. (I copied this behavior from the original implementation of SlotF1Metrics in the multiwoz agent.)

@stephenroller this is what you had changed because you were occasionally getting Nones for the metric during validation, right?

parlai/utils/task_dialogue_helpers/metrics.py

stephenroller · 2020-10-01T15:01:30Z

I think autoformat assumes you're running from the top directory. That might be something to fix, but for now, just run from the root of the repo.

…ltiple metrics be added to it See #3138 for context and use

…g in python... (alas, that's what I get for mostly coding in C++ for the past few years. :P)

EricMichaelSmith

Seems reasonable, given that tests are passing. Will let Stephen approve since he had had questions before (but otherwise happy to do so if needed)

parlai/core/tod.py

…ctly)

…3145) * Add notion of metrics collections, which can have other Metrics of multiple metrics be added to it See #3138 for context and use * right, having different arguments for the same function aren't a thing in python... (alas, that's what I get for mostly coding in C++ for the past few years. :P) * fixed a bug while integrating into taskmaster2 * address comments (get rid of separate class, add func to Metrics directly) * actually do the things the last comment

* Add test for interactive_web * Spinlock * Hm. * Lint.

* Allow missing init opt opts * Add part of unit test * Work on unit test * Test fixes * Fix second test * Fix test * Check obsolete arg does not exist

…ltiple metrics be added to it See #3138 for context and use

…ctly)

SlotF1Metric + SlotF1Metric test imported from an internal implementation. Will go through and refactor + test existing tasks to use this next.

Not quiet sure if returning "nans" for F1 metrics is the best way of going about things, but probably okay as a start. Also assuming the Metrics base class will handle addition correctly. Added the test case to make sure there weren't any glaring syntax errors with how I implemented SlotMetrics Nominally should probably add more test cases to validate slot-based domain metrics are working correctly, but that seems lower risk/lower pri and will cover it anyway when I update existing tasks.

`docformatter -i --pre-summary-newline --wrap-descriptions 88 --wrap-summaries 88 --make-summary-multi-line` on the relevant file

…shing so all my mispellings don't go into the commit log...

Noticed while testing on taskmaster that jga was higher than both slot_r and slot_p... put in some prints and turns out it was counting the {} == {} case. This doesn't really make sense to do in general, though there are some scnearios where not having any slots is the correct response... so put in a flag to compensate.

…addressing comments = adding comments + fixing grammar, mostly).

…3145) * Add notion of metrics collections, which can have other Metrics of multiple metrics be added to it See #3138 for context and use * right, having different arguments for the same function aren't a thing in python... (alas, that's what I get for mostly coding in C++ for the past few years. :P) * fixed a bug while integrating into taskmaster2 * address comments (get rid of separate class, add func to Metrics directly) * actually do the things the last comment

…ltiple metrics be added to it See #3138 for context and use

…3145) * Add notion of metrics collections, which can have other Metrics of multiple metrics be added to it See #3138 for context and use * right, having different arguments for the same function aren't a thing in python... (alas, that's what I get for mostly coding in C++ for the past few years. :P) * fixed a bug while integrating into taskmaster2 * address comments (get rid of separate class, add func to Metrics directly) * actually do the things the last comment

…ltiple metrics be added to it See #3138 for context and use

…3145) * Add notion of metrics collections, which can have other Metrics of multiple metrics be added to it See #3138 for context and use * right, having different arguments for the same function aren't a thing in python... (alas, that's what I get for mostly coding in C++ for the past few years. :P) * fixed a bug while integrating into taskmaster2 * address comments (get rid of separate class, add func to Metrics directly) * actually do the things the last comment

moyapchen · 2020-10-02T19:49:05Z

Converting this to draft mode, cause rebasing + git is doing weird stuff.

…ltiple metrics be added to it See #3138 for context and use

moyapchen · 2020-10-02T20:33:49Z

Abandoning this; will do internally and sync once it's all nice.

moyapchen requested review from stephenroller and EricMichaelSmith October 1, 2020 01:49

facebook-github-bot added the CLA Signed label Oct 1, 2020

moyapchen marked this pull request as draft October 1, 2020 01:57

moyapchen marked this pull request as ready for review October 1, 2020 06:15

stephenroller reviewed Oct 1, 2020

View reviewed changes

Add notion of metrics collections, which can have other Metrics of mu…

84e80f2

…ltiple metrics be added to it See #3138 for context and use

moyapchen mentioned this pull request Oct 1, 2020

Add ability for Metrics to add metrics from another Metrics object #3145

Merged

moyapchen force-pushed the slot_teachers branch from 76a5f54 to d3b5a19 Compare October 1, 2020 21:12

moyapchen changed the base branch from master to metric_collections October 1, 2020 21:13

right, having different arguments for the same function aren't a thin…

7793b4f

…g in python... (alas, that's what I get for mostly coding in C++ for the past few years. :P)

moyapchen force-pushed the slot_teachers branch from d3b5a19 to b4479f5 Compare October 1, 2020 21:19

fixed a bug while integrating into taskmaster2

cd40239

moyapchen force-pushed the slot_teachers branch from c79a3a6 to e3e35b7 Compare October 2, 2020 00:25

moyapchen requested a review from stephenroller October 2, 2020 07:21

EricMichaelSmith reviewed Oct 2, 2020

View reviewed changes

parlai/core/tod.py Outdated Show resolved Hide resolved

parlai/core/tod.py Show resolved Hide resolved

parlai/core/tod.py Show resolved Hide resolved

Moya Chen added 2 commits October 2, 2020 09:12

address comments (get rid of separate class, add func to Metrics dire…

95d7163

…ctly)

actually do the things the last comment

e29e591

Base automatically changed from metric_collections to master October 2, 2020 18:24

stephenroller and others added 7 commits October 2, 2020 11:33

Add test for interactive_web (#3114)

dd258a1

* Add test for interactive_web * Spinlock * Hm. * Lint.

Allow missing args when using --init-opt (#3112)

32aac03

* Allow missing init opt opts * Add part of unit test * Work on unit test * Test fixes * Fix second test * Fix test * Check obsolete arg does not exist

Add notion of metrics collections, which can have other Metrics of mu…

b24b87e

…ltiple metrics be added to it See #3138 for context and use

address comments (get rid of separate class, add func to Metrics dire…

46ed81c

…ctly)

Add file for helper metrics for slots

a26d96b

SlotF1Metric + SlotF1Metric test imported from an internal implementation. Will go through and refactor + test existing tasks to use this next.

Move files + update file comment, as suggested

69f83b1

Moya Chen added 8 commits October 2, 2020 11:33

Manually run

d2e5156

`docformatter -i --pre-summary-newline --wrap-descriptions 88 --wrap-summaries 88 --make-summary-multi-line` on the relevant file

Make sure tests are fine with moves

f71815e

Add domain-specific jga

63fe040

Fix spelling. Need to be better about running tests locally before pu…

a832b18

…shing so all my mispellings don't go into the commit log...

fixes found while integrating into taskmaster2

3f2ad58

tesssssts

c7ff814

address comments, Tuple -> List cause the errors were bothering me. (…

6b9a0a4

…addressing comments = adding comments + fixing grammar, mostly).

moyapchen force-pushed the slot_teachers branch from fea599b to 6b9a0a4 Compare October 2, 2020 19:31

moyapchen and others added 2 commits October 2, 2020 12:38

Add notion of metrics collections, which can have other Metrics of mu…

7825fcc

…ltiple metrics be added to it See #3138 for context and use

moyapchen pushed a commit that referenced this pull request Oct 2, 2020

Add notion of metrics collections, which can have other Metrics of mu…

fb39525

…ltiple metrics be added to it See #3138 for context and use

moyapchen pushed a commit that referenced this pull request Oct 2, 2020

Add notion of metrics collections, which can have other Metrics of mu…

bef4e64

…ltiple metrics be added to it See #3138 for context and use

moyapchen marked this pull request as draft October 2, 2020 19:48

figure out what's going on with rebase errors...

1f05032

moyapchen pushed a commit that referenced this pull request Oct 2, 2020

Add notion of metrics collections, which can have other Metrics of mu…

0e30f38

…ltiple metrics be added to it See #3138 for context and use

moyapchen closed this Oct 2, 2020

moyapchen deleted the slot_teachers branch October 2, 2020 20:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add file for helper metrics for slots #3138

Add file for helper metrics for slots #3138

moyapchen commented Oct 1, 2020 •

edited

Loading

moyapchen commented Oct 1, 2020

moyapchen commented Oct 1, 2020

stephenroller Oct 1, 2020

moyapchen Oct 1, 2020

stephenroller Oct 2, 2020

moyapchen Oct 2, 2020

EricMichaelSmith Oct 2, 2020

stephenroller commented Oct 1, 2020

EricMichaelSmith left a comment •

edited

Loading

moyapchen commented Oct 2, 2020

moyapchen commented Oct 2, 2020

Add file for helper metrics for slots #3138

Add file for helper metrics for slots #3138

Conversation

moyapchen commented Oct 1, 2020 • edited Loading

moyapchen commented Oct 1, 2020

moyapchen commented Oct 1, 2020

stephenroller Oct 1, 2020

Choose a reason for hiding this comment

moyapchen Oct 1, 2020

Choose a reason for hiding this comment

stephenroller Oct 2, 2020

Choose a reason for hiding this comment

moyapchen Oct 2, 2020

Choose a reason for hiding this comment

EricMichaelSmith Oct 2, 2020

Choose a reason for hiding this comment

stephenroller commented Oct 1, 2020

EricMichaelSmith left a comment • edited Loading

Choose a reason for hiding this comment

moyapchen commented Oct 2, 2020

moyapchen commented Oct 2, 2020

moyapchen commented Oct 1, 2020 •

edited

Loading

EricMichaelSmith left a comment •

edited

Loading