Adding a `ReFT` notebook to the tutorials section #741

julian-fong · 2024-09-30T02:57:42Z

This PR aims to add a tutorial notebook to utilize the Loreft adapter to fine-tune roberta-base on the mnli dataset.

Reviews appreciated!

calpt

Looks great overall, thanks for working on this! Left some mostly minor comments to be adressed before we merge :)

Additionally, could you add this new notebook to the list in the README of the notebooks folder, thanks!

calpt · 2024-10-06T19:24:52Z

notebooks/ReFT_Adapters_Finetuning.ipynb

+    "\n",
+    "In this tutorial, we will be demonstrating how to fine-tune a language model using [Representation Finetuning for Language Models](https://arxiv.org/abs/2404.03592)\n",
+    "\n",
+    "We will use a traditional large language model and focus on fine tuning via ReFT adapters rather than the traditional full model fine tuning.\n",


i'm not sure "traditional large language model" is a good description here. maybe something like "lightweight encoder model" is a good description for Roberta nowadays.

calpt · 2024-10-06T19:28:05Z

notebooks/ReFT_Adapters_Finetuning.ipynb

+   "source": [
+    "### Model and Adapter initialization\n",
+    "\n",
+    "We load the `roberta-base` model along with the `LoReftConfig`. We can initalize a `reft` config with only one line of code, and can add it to our base model using the `add_adapter` function. On top of that, we can add a classification head to our adapter specifying 3 labels.\n",


could be nice to link our docs on ReFT here: https://docs.adapterhub.ml/methods.html#reft and mention that there is explanation on supported config parameters.

calpt · 2024-10-06T19:28:58Z

notebooks/ReFT_Adapters_Finetuning.ipynb

+    "    learning_rate=6e-4,\n",
+    "    per_device_train_batch_size=32,\n",
+    "    per_device_eval_batch_size=32,\n",
+    "    num_train_epochs=2,\n",


please make a note that usually you'd train longer and this is only for demo purposes.

calpt · 2024-10-06T19:33:04Z

notebooks/ReFT_Adapters_Finetuning.ipynb

+    "    input_ids = tokenizer(text, truncation=True, padding='max_length')\n",
+    "    input_ids[\"input_ids\"] = torch.tensor(input_ids[\"input_ids\"])\n",
+    "    input_ids[\"attention_mask\"] = torch.tensor(input_ids[\"attention_mask\"])\n",


can add return_tensors="pt" (same as in preprocess_function) to tokenizer call so you don't need to convert to tensors afterwards

ah, good catch!

julian-fong · 2024-10-06T23:24:04Z

Looks great overall, thanks for working on this! Left some mostly minor comments to be adressed before we merge :)

Additionally, could you add this new notebook to the list in the README of the notebooks folder, thanks!

Sounds good, should the whisper notebook be added as well? I dont see it in the read.me

TimoImhof · 2024-10-07T13:06:45Z

Looks great overall, thanks for working on this! Left some mostly minor comments to be adressed before we merge :)
Additionally, could you add this new notebook to the list in the README of the notebooks folder, thanks!

Sounds good, should the whisper notebook be added as well? I dont see it in the read.me

Yes please, we forgot that 👍

calpt

thanks for including the feedback!

julian-fong added 2 commits September 29, 2024 22:56

initial commit

420bcbd

made some corrections

09c0940

calpt reviewed Oct 6, 2024

View reviewed changes

julian-fong added 2 commits October 6, 2024 19:36

updates as per review

7b7b41a

removed saved model loading to test code

2a4b34f

julian-fong added 2 commits October 7, 2024 09:37

added whisper to the read.me

8e0e248

fixed typo

11cbeb8

calpt approved these changes Oct 13, 2024

View reviewed changes

calpt merged commit 6fefc9a into adapter-hub:main Oct 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding a `ReFT` notebook to the tutorials section #741

Adding a `ReFT` notebook to the tutorials section #741

julian-fong commented Sep 30, 2024

calpt left a comment

calpt Oct 6, 2024

calpt Oct 6, 2024

calpt Oct 6, 2024

calpt Oct 6, 2024

julian-fong Oct 6, 2024

julian-fong commented Oct 6, 2024

TimoImhof commented Oct 7, 2024

calpt left a comment

Adding a ReFT notebook to the tutorials section #741

Adding a ReFT notebook to the tutorials section #741

Conversation

julian-fong commented Sep 30, 2024

calpt left a comment

Choose a reason for hiding this comment

calpt Oct 6, 2024

Choose a reason for hiding this comment

calpt Oct 6, 2024

Choose a reason for hiding this comment

calpt Oct 6, 2024

Choose a reason for hiding this comment

calpt Oct 6, 2024

Choose a reason for hiding this comment

julian-fong Oct 6, 2024

Choose a reason for hiding this comment

julian-fong commented Oct 6, 2024

TimoImhof commented Oct 7, 2024

calpt left a comment

Choose a reason for hiding this comment

Adding a `ReFT` notebook to the tutorials section #741

Adding a `ReFT` notebook to the tutorials section #741