Add examples telemetry #17552

sgugger · 2022-06-03T18:46:53Z

What does this PR do?

This PR adds a function to send telemetry to help us track the examples usage and uses it in the current examples. For now, I've just added in the PyTorch run_glue.py, but will paste it in all other examples if you agree with the format/data tracked.

HuggingFaceDocBuilderDev · 2022-06-03T18:56:54Z

The documentation is not available anymore as the PR was closed or merged.

LysandreJik · 2022-06-06T07:37:51Z

examples/pytorch/text-classification/run_glue.py

+    # Sending telemetry. Tracking the example usage helps us better allocate resources to maintain them. The
+    # information sent is the one passed as arguments along with your Python/PyTorch versions.
+    model_name = None if os.path.isdir(model_args.model_name_or_path) else model_args.model_name_or_path
+    if data_args.task_name is not None:
+        dataset_name = f"glue-{data_args.task_name}"
+    elif data_args.dataset_name is not None:
+        dataset_name = data_args.dataset_name
+    else:
+        dataset_name = None
+    send_example_telemetry("run_glue", model_name=model_name, dataset_name=dataset_name)


I'd rather have as few lines of code as possible for the telemetry, so as to not trouble understanding. Here I understand all of that block is just for the send_example_telemetry thanks to the diff, but if I was reading the example in order to understand it I think I would think that model_name and dataset_name are re-used afterwards.

How about passing the model_args/data_args directly to the method? Or if you think there is too much information in there, how about defining a local method so that we understand the scope of these variables?

LysandreJik

That's perfect! Thanks for iterating, @sgugger

LysandreJik · 2022-06-07T12:06:15Z

examples/flax/summarization/run_summarization_flax.py

@@ -399,6 +399,10 @@ def main():
    else:
        model_args, data_args, training_args = parser.parse_args_into_dataclasses()

+    # Sending telemetry. Tracking the example usage helps us better allocate resources to maintain them. The
+    # information sent is the one passed as arguments along with your Python/PyTorch versions.
+    send_example_telemetry("flax_run_summarization", model_args, data_args)


Reading this, I was wondering if we shouldn't add the framework in front of all examples: flax_run_summarization, pytorch_run_summarization, etc.

But I wonder if we can't do it better by doing it with pytorch/run_summarization -> would that lead to better analysis with kibana? In any case I think having the framework name is super useful, so I'd personally add it to all examples

LysandreJik

Excellent!

patrickvonplaten · 2022-06-07T23:32:41Z

Thanks a lot for working on this @sgugger - that's super useful!

* Add examples telemetry * Alternative approach * Add to all other examples * Add to templates as well * Put framework separately * Same for TensorFlow

Add examples telemetry

6e17156

sgugger requested review from patrickvonplaten and LysandreJik June 3, 2022 18:46

LysandreJik reviewed Jun 6, 2022

View reviewed changes

Alternative approach

2f2221b

LysandreJik approved these changes Jun 6, 2022

View reviewed changes

sgugger added 2 commits June 6, 2022 15:27

Add to all other examples

b394a9d

Add to templates as well

b2bafef

LysandreJik reviewed Jun 7, 2022

View reviewed changes

sgugger added 2 commits June 7, 2022 09:54

Put framework separately

9d27e9c

Same for TensorFlow

fb5f8d0

LysandreJik approved these changes Jun 7, 2022

View reviewed changes

sgugger merged commit 3cab902 into main Jun 7, 2022

sgugger deleted the telemetry branch June 7, 2022 15:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add examples telemetry #17552

Add examples telemetry #17552

sgugger commented Jun 3, 2022

HuggingFaceDocBuilderDev commented Jun 3, 2022 •

edited

Loading

LysandreJik Jun 6, 2022

LysandreJik left a comment

LysandreJik Jun 7, 2022

LysandreJik left a comment

patrickvonplaten commented Jun 7, 2022

Add examples telemetry #17552

Add examples telemetry #17552

Conversation

sgugger commented Jun 3, 2022

What does this PR do?

HuggingFaceDocBuilderDev commented Jun 3, 2022 • edited Loading

LysandreJik Jun 6, 2022

Choose a reason for hiding this comment

LysandreJik left a comment

Choose a reason for hiding this comment

LysandreJik Jun 7, 2022

Choose a reason for hiding this comment

LysandreJik left a comment

Choose a reason for hiding this comment

patrickvonplaten commented Jun 7, 2022

HuggingFaceDocBuilderDev commented Jun 3, 2022 •

edited

Loading