Add data collator for causal completion training #292

Ssukriti · 2023-12-07T20:10:41Z

This efforts has ability for completion only LM Training as an optional flag. How it should be exposed to users on product is being discussed, but we want to add the ability to support it in library for Causal LMs

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>

Ssukriti · 2023-12-07T21:21:58Z

caikit_nlp/modules/text_generation/peft_prompt_tuning.py

@@ -300,6 +300,8 @@ def train(
        torch_dtype: Optional[str] = None,  # TODO: Optional[Union[torch.dtype, str]]
        silence_progress_bars: Optional[bool] = True,
        seed: int = RANDOM_SEED,
+        train_on_completion: bool = False,
+        response_template: str = None,


decision to expose the response template flag may change, just fYI. I wont be merging this code right away, but I want to ensure rest of it looks ok, assuming it is a user provided argument

alex-jw-brooks

I think this looks good sukriti! Some thoughts, thanks!

alex-jw-brooks · 2023-12-07T21:37:07Z

caikit_nlp/resources/pretrained_model/hf_auto_causal_lm.py

+                collator_kwargs["mlm"] = False
+
+            return DataCollatorForCompletionOnlyLM(
+                tokenizer=self._tokenizer, return_tensors="pt", **collator_kwargs


Have you tested this with multiGPU by any chance? In the past, we've seen some kind of strange behavior for some of the collators around dynamic padding, curious if that's been observed here

Raghu's team has tested it with MultiGPU, but the plan is to only use this codebase for single GPU going forward. MultiGPu will go through non caikit path

alex-jw-brooks · 2023-12-07T21:39:19Z

caikit_nlp/resources/pretrained_model/hf_auto_causal_lm.py

+            if "mlm" not in collator_kwargs:
+                collator_kwargs["mlm"] = False
+
+            return DataCollatorForCompletionOnlyLM(


The SFT Trainer hasn't been integrated quite yet, correct? Could you add a comment here for a TODO to validate that this can't be used if the trainer is initialized with packing=True so that we don't miss that edge case in the future? https://huggingface.co/docs/trl/v0.7.4/en/sft_trainer#train-on-completions-only

SFTTRAiner wont be integrated in thsi codebase :) this is probably one of last PRs to go in for tuning, only Lora after this

background being we wnat to enable completion training for current PT and LOra which will be through thsi codebase

i will add the comment though, thanks

alex-jw-brooks · 2023-12-07T21:40:52Z

caikit_nlp/resources/pretrained_model/hf_auto_causal_lm.py

@@ -168,6 +169,18 @@ def _get_data_collator(self, **kwargs) -> "transformers.DataCollator":
                Collator to be used for causal language modeling.
        """

+        if "train_on_completion" in kwargs and kwargs["train_on_completion"]:
+            applicable_args = ["mlm", "response_template", "instruction_template"]


I think it's okay for now, but we might want to link the place where we get applicable args from for different collator types - otherwise this might get confusing eventually

alex-jw-brooks · 2023-12-07T21:41:57Z

pyproject.toml

@@ -28,6 +28,7 @@ dependencies = [
    "torch>=2.0.1",
    "tqdm>=4.65.0",
    "transformers>=4.32.0",
+    "trl>=0.7.2",


Should we pin upper bound, given that trl is in 0.x?

oh yes, thanks, will do

alex-jw-brooks · 2023-12-07T21:51:30Z

examples/run_peft_tuning.py

+            help="Train on completion True or False",
+            default=False,
+            type=bool,
+            choices=[True, False],


Rather than choices, I think the encouraged pattern to use for bool flags in argparse is to use action=store_true or action=store_false - In general, using bool as a type converter can do some weird stuff for argparse because it usually converts nonempty strings to True. I think this might not work quite as expected

import argparse parser = argparse.ArgumentParser() parser.add_argument( "--train_on_completion", help="Train on completion True or False", default=False, type=bool, choices=[True, False], ) args = parser.parse_args() print(args)

running

python3 testit.py --train_on_completion=False

produces Namespace(train_on_completion=True)

thanks, I didnt even mean to commit this example actually, I will make this change and test that example actually works.

Ssukriti added 5 commits December 6, 2023 21:03

add DataCollatorForCompletionOnlyLM

8961efe

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>

enable example

ae5f147

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>

fix conflicting argument

318e9cb

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>

update tests for data collator

ea6a7dc

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>

enable data collator completion LM

5f77a41

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>

Ssukriti requested review from alex-jw-brooks, gkumbhat, evaline-ju, gabe-l-hart and tharapalanivel as code owners December 7, 2023 20:10

Ssukriti changed the title ~~Add data collator~~ Add data collator for causal completion training Dec 7, 2023

Ssukriti commented Dec 7, 2023

View reviewed changes

alex-jw-brooks requested changes Dec 7, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add data collator for causal completion training #292

Add data collator for causal completion training #292

Ssukriti commented Dec 7, 2023

Ssukriti Dec 7, 2023 •

edited

Loading

alex-jw-brooks left a comment

alex-jw-brooks Dec 7, 2023

Ssukriti Dec 7, 2023

alex-jw-brooks Dec 7, 2023

Ssukriti Dec 7, 2023 •

edited

Loading

Ssukriti Dec 7, 2023

Ssukriti Dec 7, 2023

alex-jw-brooks Dec 7, 2023

alex-jw-brooks Dec 7, 2023

Ssukriti Dec 7, 2023

alex-jw-brooks Dec 7, 2023

Ssukriti Dec 7, 2023

Add data collator for causal completion training #292

Are you sure you want to change the base?

Add data collator for causal completion training #292

Conversation

Ssukriti commented Dec 7, 2023

Ssukriti Dec 7, 2023 • edited Loading

Choose a reason for hiding this comment

alex-jw-brooks left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Ssukriti Dec 7, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Ssukriti Dec 7, 2023 •

edited

Loading

Ssukriti Dec 7, 2023 •

edited

Loading