Add an examples folder for code downstream tasks #18679

loubnabnl · 2022-08-18T09:13:05Z

What does this PR do?

This PR adds a folder in CodeParrot directory to store examples for downstream tasks on code models.

HuggingFaceDocBuilderDev · 2022-08-18T09:31:42Z

The documentation is not available anymore as the PR was closed or merged.

lvwerra

Looks good, I mostly left comments for simplifying the training code a bit.

lvwerra · 2022-08-18T10:05:38Z

examples/research_projects/codeparrot/examples/train_complexity_predictor.py

+    optimizer = AdamW(get_grouped_params(model, args), lr=args.learning_rate)
+    lr_scheduler = get_scheduler(
+        name=args.lr_scheduler_type,
+        optimizer=optimizer,
+        num_training_steps=args.num_epochs,
+        num_warmup_steps=args.num_warmup_steps,
+    )


I think you can define all that in the training arguments, no? No need to pass an optimizer/lr_scheduler explicitely?

I think by default the linear scheduler is used and I needed cosine scheduler here, but you're right we don't need to specify the optimizer

I think you can also specify cosine: https://github.com/huggingface/transformers/blob/v4.21.1/src/transformers/trainer_utils.py#L356

lvwerra · 2022-08-18T10:06:24Z

examples/research_projects/codeparrot/examples/train_complexity_predictor.py

+def get_grouped_params(model, args, no_decay=["bias", "ln_1.weight", "ln_2.weight", "ln_f.weight"]):
+    params_with_wd, params_without_wd = [], []
+    for n, p in model.named_parameters():
+        if any(nd in n for nd in no_decay):
+            params_without_wd.append(p)
+        else:
+            params_with_wd.append(p)
+    return [
+        {"params": params_with_wd, "weight_decay": args.weight_decay},
+        {"params": params_without_wd, "weight_decay": 0.0},
+    ]


I think if you don't pass the optimizer explicitly, the Trainer will take care of that for you, no?

lvwerra · 2022-08-18T10:09:08Z

examples/research_projects/codeparrot/examples/train_complexity_predictor.py

+class CustomCallback(TrainerCallback):
+    def __init__(self, trainer) -> None:
+        super().__init__()
+        self._trainer = trainer
+
+    def on_epoch_end(self, args, state, control, **kwargs):
+        if control.should_evaluate:
+            control_copy = deepcopy(control)
+            self._trainer.evaluate(eval_dataset=self._trainer.train_dataset, metric_key_prefix="train")
+            return control_copy


Why is this needed? Isn't this the same as evaluation_strategy="epoch" in the training arguments? also why do you evaluate on the train set?

I added this because I wanted to monitor the gap in accuracy between the training set and evaluation set

lvwerra · 2022-08-18T10:10:25Z

examples/research_projects/codeparrot/README.md

@@ -12,7 +12,7 @@ This is an open-source effort to train and evaluate code generation models. Code
 - continuously push checkpoints to the hub with `huggingface_hub`
 - stream the dataset with `datasets` during training to avoid disk bottlenecks
 - apply the `code_eval` metric in `datasets` to evaluate on [OpenAI's _HumanEval_ benchmark](https://huggingface.co/datasets/openai_humaneval)
-
+- showcase examples for downstream tasks with code models in [examples](https://github.com/huggingface/transformers/tree/main/examples/research_projects/codeparrot/examples) folder


maybe we can say what examples we show there. should we also add the code for the text2py and py2text here?

Ok but the text2py and py2text examples use a similar script to the one for pretraining codeparrot just with a different dataset and model checkpoint, maybe I can just mention that in the README?

Sounds good. I think mentioning the settings would also be useful to document.

* add examples subfolder * mention examples in codeparrot readme * use Trainer optimizer and scheduler type and add output_dir as argument * add example of text-to-python and python-to-text models * mention the downstream examples in the readme * fix typo

loubnabnl requested a review from lvwerra August 18, 2022 09:13

lvwerra reviewed Aug 18, 2022

View reviewed changes

loubnabnl and others added 9 commits August 18, 2022 12:07

add examples subfolder

8f21968

reformat file

f8263d1

mention examples in codeparrot readme

3794de6

reformat imports

c961253

use Trainer optimizer and scheduler type and add output_dir as argument

7ab673e

add example of text-to-python and python-to-text models

1015a06

reformat imports

fa8a0df

mention the downstream examples in the readme

da21da8

reformat code

17e508d

loubnabnl force-pushed the add-examples-downstream-codeparrot branch from 12762f5 to 17e508d Compare August 18, 2022 13:15

fix typo

6f49ca4

lvwerra approved these changes Aug 18, 2022

View reviewed changes

loubnabnl merged commit bbbb453 into huggingface:main Aug 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an examples folder for code downstream tasks #18679

Add an examples folder for code downstream tasks #18679

loubnabnl commented Aug 18, 2022

HuggingFaceDocBuilderDev commented Aug 18, 2022 •

edited

Loading

lvwerra left a comment

lvwerra Aug 18, 2022

loubnabnl Aug 18, 2022

lvwerra Aug 18, 2022

lvwerra Aug 18, 2022

lvwerra Aug 18, 2022

loubnabnl Aug 18, 2022

lvwerra Aug 18, 2022

loubnabnl Aug 18, 2022

lvwerra Aug 18, 2022

Add an examples folder for code downstream tasks #18679

Add an examples folder for code downstream tasks #18679

Conversation

loubnabnl commented Aug 18, 2022

What does this PR do?

HuggingFaceDocBuilderDev commented Aug 18, 2022 • edited Loading

lvwerra left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Aug 18, 2022 •

edited

Loading