Full Tensorboard metric titles #3534

spencerp · 2021-03-17T20:58:51Z

Patch description
The abbreviated metric titles are hard to get used to, but using longer metric identifiers makes the stdout print hard to parse.
This PR adds a separation between the metric identifier and the display name of the metric. It also adds a description. The title and description are included in tensorboard, but the abbreviated identifiers are used in the logs.
It moves the source of truth for metric titles and descriptions from the docs to the code, and generates the metrics table from this.

There are a few things I don't really have time to tackle right now, but would be nice for a later PR:

Representing families of metrics more automatically in the source of truth dict (collapse rouge-* metrics and use appropriate name and description if any metrics matching that format are used).
Collapsing multiple metrics of the same family in the metrics table
Preserving the monospace formatting of the metrics in the docs table

Testing steps
Ran a basic test run locally:

parlai train_model --task babi:task10k:1 --model-file ~/tmp/babi_memnn --batchsize 1 --num-epochs 5 --model memnn --no-cuda -tblog True

Verified the metrics were short and fit on the screen:

Verified that they were long and had descriptions on Tensorboard:

Built website:

cd docs; make html

Verified the metrics list was rendered:

stephenroller

v cool

docs/source/generate_metric_list.py

stephenroller · 2021-03-17T22:28:43Z

parlai/core/metrics.py

@@ -34,6 +34,100 @@
 }
 ALL_METRICS = DEFAULT_METRICS | ROUGE_METRICS | BLEU_METRICS | DISTINCT_METRICS

+MetricDisplayData = namedtuple('MetricDisplayData', ('title', 'description'))


how about a data class instead

Hm, what's the advantage you see? I was using a namedtuple because this isn't really data that should be mutable.

If it's the pretty syntax you're looking for, though, I just found out there's a nicer syntax for namedtuples:

class MetricDisplayData(NamedTuple): title: str description: str

We used NamedTuples before and it gave me headaches and now i stay away with them. That syntax is nicer and fine with me.

I'm curious what headaches you encountered! I haven't used them a ton (just for small things like this here and there) so maybe there're headaches incoming I'm ignorant of.

stephenroller · 2021-03-17T22:29:20Z

parlai/core/metrics.py

+}
+
+
+def get_metric_display_data(metric: str) -> MetricDisplayData:


maybe as a utility of MetricsDisplayData

It's kind of nice to keep this functional, though, since there isn't any state we should be keeping around. Also, it needs access to METRICS_DISPLAY_DATA which I think makes more sense scoped to the namespace than a class. What's the advantage you see from putting it in MetricDisplayData?

I was thinking of a classmethod (and maybe the global too), just to keep everything in a tight namespace.

lol prolly the global can't be in there so long as it's self-typed...

anyway saul goodman

Yeah I think if we went that path, the metrics/titles/descriptions would live in a separate json/yaml file. And we'd have a separate function that loads them up as MetricDisplayDatas into a global. But then there'd be a disconnect between the source of truth and the global which is a little weird. I guess we could make MetricDisplayData a singleton and load them up on instantiation, but then we have to instantiate an object just to get this static list of strings which feels heavy.

Another way to keep them in a tight namespace would be to just create a metrics_list.py module.

Idk let me know if any of those options sound better, I see plenty of advantages and disadvantages to each so not super opinionated lol

parlai/core/metrics.py

spencerp and others added 3 commits March 17, 2021 12:24

shorten truncation len metric

8dff4c2

metric titles and descriptions

793f014

add backticks

fdab2e6

spencerp requested review from stephenroller and jaseweston March 17, 2021 20:58

facebook-github-bot added the CLA Signed label Mar 17, 2021

spencerp added 2 commits March 17, 2021 14:02

update unit test

791b35b

move metrics into to metrics.py and remove id from title

c42f295

stephenroller reviewed Mar 17, 2021

View reviewed changes

spencerp added 2 commits March 17, 2021 18:40

stephen's comments

6edbb6b

missed one

d31a9af

stephenroller approved these changes Mar 18, 2021

View reviewed changes

spencerp merged commit d016e58 into master Mar 18, 2021

spencerp deleted the trun-len-2 branch March 18, 2021 21:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Full Tensorboard metric titles #3534

Full Tensorboard metric titles #3534

spencerp commented Mar 17, 2021

stephenroller left a comment

stephenroller Mar 17, 2021

spencerp Mar 18, 2021

stephenroller Mar 18, 2021

spencerp Mar 18, 2021

stephenroller Mar 17, 2021

spencerp Mar 18, 2021

stephenroller Mar 18, 2021

stephenroller Mar 18, 2021 •

edited

Loading

spencerp Mar 18, 2021

		}


		def get_metric_display_data(metric: str) -> MetricDisplayData:

Full Tensorboard metric titles #3534

Full Tensorboard metric titles #3534

Conversation

spencerp commented Mar 17, 2021

stephenroller left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stephenroller Mar 18, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stephenroller Mar 18, 2021 •

edited

Loading