[RFC][DOCS] Recipe [DOCS] ([DOC]umentation) #1230

SalmanMohammadi · 2024-07-26T13:25:16Z

What is the purpose of this PR? Is it to

add a new feature
fix a bug
update tests and/or documentation
other (please add here)

Let's map out a user journey here:

I'm a l33t gamer with a 2xRTX 4090 battlestation. I've just beat Cyberpunk (on max settings, mind you), and now I'd like to get into fine-tuning LLMs. I'm excited about the latest release of my favourite animal-based LLM, Llama3.1. What do I want to do? Not 100% sure - fine-tuning an LLM for my Discord? The LLama 3.1 documentation has helped me find ✨torchtune✨! What can I do with it?? How can I quickly discover this?
Oh, cool, there's a quick tutorial for fine-tuning Llama3? And it uses LoRA? Great! And there's 🔥🔥🔥 documentation on customizing my own datasets? Amazing!!
Now, what else can I do with this LoRA "recipe"? Isn't there a page where I can understand what all these parameters mean, at once?
...
Well, what other recipes are there? What else can I do with ✨torchtune✨?
...

Right now, don't have a clear way to communicate to our users which recipes we support, and how to quickly configure them. Documentation for recipes is:

Hidden inside recipe files. Recipe-specific features are listed amongst features common to all recipes. This makes the documentation a bit difficult to parse.
Hidden inside our config files, which are also replicated across our configs. Our config files contain some of the more crucial details when using recipes; which commands to run, which levers to pull.

I want to maximise the surface area of the features we expose in torchtune. Our design philosophy keeps things flat and modular, users can swap out components, models, and datasets freely. I wish for the ML PhD and the l33t gamer to be able to discover what we offer, and how to use it, with equal ease.

My contribution addresses this in the following ways:

I propose a glossary of features which are common across all of our recipes, and also common amongst recipes with specialised fine-tuning features such as PEFT or FSDP(2).
I propose a simple recipe documentation template. This template uses the commands we'd usually place inside the config files to allow users to quickly get started with the recipe. This template then includes copy-and-paste text for lists of features which we commonly expose in recipes - these link to relevant sections in the glossary above.
These recipes are then simply indexed in the recipe overview.

I provide two examples in this PR; documentation for LoRA single device, and for QAT distributed. I'd like to put an issue up for documenting additional recipes so other people may help out here - it'll be a good first issue for many contributors..

There's also a couple things missing from the memory glossary; FSDP/FSDP2 (and maybe something else?), I'll also put issues up for this.

Test plan

⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⢀⣀⣤⣤⣤⣤⣴⣤⣤⣄⡀⠀⠀⠀⠀⠀⠀⠀⠀⠀
⠀⠀⠀⠀⠀⠀⠀⣀⣴⣾⠿⠛⠋⠉⠁⠀⠀⠀⠈⠙⠻⢷⣦⡀⠀⠀⠀⠀⠀⠀
⠀⠀⠀⠀⠀⣤⣾⡿⠋⠁⠀⣠⣶⣿⡿⢿⣷⣦⡀⠀⠀⠀⠙⠿⣦⣀⠀⠀⠀⠀
⠀⠀⢀⣴⣿⡿⠋⠀⠀⢀⣼⣿⣿⣿⣶⣿⣾⣽⣿⡆⠀⠀⠀⠀⢻⣿⣷⣶⣄⠀
⠀⣴⣿⣿⠋⠀⠀⠀⠀⠸⣿⣿⣿⣿⣯⣿⣿⣿⣿⣿⠀⠀⠀⠐⡄⡌⢻⣿⣿⡷
⢸⣿⣿⠃⢂⡋⠄⠀⠀⠀⢿⣿⣿⣿⣿⣿⣯⣿⣿⠏⠀⠀⠀⠀⢦⣷⣿⠿⠛⠁
⠀⠙⠿⢾⣤⡈⠙⠂⢤⢀⠀⠙⠿⢿⣿⣿⡿⠟⠁⠀⣀⣀⣤⣶⠟⠋⠁⠀⠀⠀
⠀⠀⠀⠀⠈⠙⠿⣾⣠⣆⣅⣀⣠⣄⣤⣴⣶⣾⣽⢿⠿⠟⠋⠀⠀⠀⠀⠀⠀⠀
⠀⠀⠀⠀⠀⠀⠀⠀⠀⠉⠙⠛⠛⠙⠋⠉⠉⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀

pytorch-bot · 2024-07-26T13:25:19Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/1230

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 3d06179 with merge base 3653c4a ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

pbontrager

This first draft is looking great, and right in line with what I think we would want to expose the recipes.

docs/source/recipes/recipes_index.rst

docs/source/recipes/lora_finetune_single_device.rst

recipes/__init__.py

docs/source/recipes/lora_finetune_single_device.rst

docs/source/tutorials/memory_optimisations.rst

RdoubleA · 2024-08-05T14:33:27Z

Seems like #1252 would be well placed to address this?

Yes, if this is something @felipemello1 is planning to do. But we can leave as a follow-up

…to recipe_docs

SalmanMohammadi · 2024-08-05T17:45:05Z

@RdoubleA think I've addressed everything. Thanks again for such a thorough review

docs/source/index.rst

docs/source/tutorials/memory_optimisations.rst

ebsmothers · 2024-08-05T22:00:13Z

docs/source/tutorials/memory_optimisations.rst

+torchtune comes with a host of plug-and-play memory optimization components which give you lots of flexibility
+to ``tune`` our recipes to your hardware. This page provides a brief glossary of these components and how you might use them.


Commenting at the top after having read this whole tutorial: I think this is a great start and a sorely-needed tutorial. Just a couple things to think about to give it a more unified feel (doesn't have to be part of this PR, this is more aspirational).

Right now this leans more towards an index of techniques with some tradeoff discussions, I'd like to see it become more of a cookbook of what to apply and when. To that end, we could shoot for some kind of table enumerating (a) various memory-savings techniques and their impact, (b) implications for performance and/or model quality, and (c) guidance on which ones to try or combine under different regimes. Anyways this is a longer-term thing so no need to worry about it for this PR, would definitely be interested to hear your thoughts on whether something like that makes sense to you.

As discussed offline, both your's and Rafi's suggestions are on-the-nose.

Can we start collating these ideas centrally? We have this, #954, #1163, #1260, #1252, #1282

docs/source/recipes/lora_finetune_single_device.rst

docs/source/recipes/qat_distributed.rst

felipemello1

Great changes! Thank you so much for doing them. I am sure that the users will be thrilled :D

Overall comments can probably be summarize as two:

IMO, we can probably make some content more focused (e.g. less links);
There is some room to share trade-offs in the memory related parameters and suggesting reasonable defaults (I really liked this section!);

felipemello1 · 2024-08-13T01:00:15Z

docs/source/recipes/lora_finetune_single_device.rst

@@ -0,0 +1,60 @@
+.. _lora_finetune_recipe_label:
+


My two unfiltered cents: I am not sure how much I like all the links. I understand the intention, but in general, a good rule of thumb for me is "less is more". Also, as an engineer, i feel that most of us like when things go straight to the point, e.g. "show me code" or "a picture is worth a thousand words". However, when I try to think: "ok, how would i rewrite it?", it becomes a bit hard for me articulate something intelligent. So, if others are comfortable with it, its fine with me. But if its a shared feeling, maybe we could revisit it.

The links at the bottom are interesting though, as a type of "keep reading".

TLDR: Maybe increase ratio of LoRA-information / information. If most of the information are links or notes, then it may be too much noise.

I'm not sure I 100% agree here - IMO these docs aren't necessarily aimed towards engineers who are used to quickly reading condensed information, but to maximise discovery of what we offer in torchtune - this was my primary motivation for writing these.

I will comb through the links and make sure they're relevant/necessary though : )

Eh I do get @felipemello1's sentiment on this one. Esp because we immediately lead with 5 links. While many of them may be useful, I think we should instead lead with an example or something. Otherwise as a reader who just wants to understand how this recipe works/what it does I am immediately overwhelmed with pointers to literally half our live docs, making it hard to tease out what the actual relevant information is.

I see now, sorry Felipe I don't think I fully grasped your original point : ) I'll address.

I've updated to try remove the amount of noise and provide concrete examples, and leave additional information at the bottom.

felipemello1 · 2024-08-13T01:04:22Z

docs/source/recipes/lora_finetune_single_device.rst

+* :ref:`glossary_lora`.
+* :ref:`glossary_qlora`.
+
+As with all of our recipes, you can also:


maybe this is redundant, and pointing the user to "<config_tutorial_label>" is enough?

Sorry, maybe this file has been changed - what was this referring to?

on line 43, we say "Check out our :ref:configs tutorial <config_tutorial_label> to learn how to customize recipes to suit your needs.""

But then from line 51 to 57 we talk about config options not related to lora. I was thinking that maybe we dont need to add 4 links and 6 lines if this is already in the config_tutorial_label

I don't think the config tutorial includes these things. This section is more to make it abundantly clear that you can still other memory optimization features alongside LoRA. It's a generic section you can include in most of the recipe docs, so someone doesn't need to figure out which components they can use with which recipes

felipemello1 · 2024-08-13T01:10:42Z

docs/source/recipes/lora_finetune_single_device.rst

+LoRA Single Device Finetuning
+=============================
+
+This recipe supports finetuning on next-token prediction tasks using `LoRA <https://arxiv.org/abs/2106.09685>`_,


idea: Maybe for every recipe, we could add at the very start:

Use this recipe if:

A

B

C

For example:

Use this recipe if you:

Only have access to one GPU;

Want a small checkpoint;

Don't have much GPU memory available;

Is ok to possibly compromise a bit of accuracy in exchange of the above;

I like this idea!

I wonder if it might be better to include in the recipe overviews? Almost like a table of different compute desiderata and which recipes fulfill them.

felipemello1 · 2024-08-13T01:24:36Z

docs/source/recipes/recipes_overview.rst

+.. _recipes_overview_label:
+
+================
+Recipes Overview


Maybe we could delete it and this could be the first part of the recipe_deepdive? Or if its just supposed to be an index, maybe it can just be a list and leave the information for the recipe pages.

I personally like having a little context so this can be a kind-of standalone document for a reader. Second opinions? @joecummings @RdoubleA @ebsmothers @pbontrager

Personally I like it. Without it there we just jump right into individual recipe pages and it's not really clear what their purpose is; I feel like this page provides useful framing for the entire section.

docs/source/tutorials/first_finetune_tutorial.rst

docs/source/tutorials/memory_optimizations.rst

SalmanMohammadi · 2024-08-13T22:50:44Z

Okay, okay, I'll add a table. I saw this https://pytorch.org/docs/stable/torch.compiler_fine_grain_apis.html and I liked it.

SalmanMohammadi · 2024-08-19T13:09:43Z

Think I've addressed all the comments. Added a table at the top of the tutorial. Have DPO/PPO recipes docs in the oven too.

docs/source/tutorials/memory_optimizations.rst

felipemello1 · 2024-08-19T15:54:44Z

docs/source/tutorials/memory_optimizations.rst

@@ -8,14 +8,29 @@ Memory Optimization Overview

 torchtune comes with a host of plug-and-play memory optimization components which give you lots of flexibility
 to ``tune`` our recipes to your hardware. This page provides a brief glossary of these components and how you might use them.
+To make things easy, we've summarized these components in the following table:
+
+.. csv-table:: Memory optimization components


I like the table, but i think it misses the trade-offs/numbers and for the most part repeats: "Use when memory constrained". Maybe an index would be fine?

When we have the table with all the % value of VRAM/tps, i think we could put it back here. WDYT?

I've tried to callout a bit more the compromises each component requires on training speed, does that help? I'd like to make this useful and help users who don't want to read the whole tutorial, so very open to suggestions.

felipemello1 · 2024-08-19T15:56:40Z

thanks for making the changes. I think that they addressed most if not all of my concerns.

SalmanMohammadi · 2024-08-19T15:57:49Z

thanks for making the changes. I think that they addressed most if not all of my concerns.

tysm for reviewing : )

joecummings

Let's get this in and iterate as needed.

It's a 100% improvement from having zero information on our recipes. Thanks for this Herculean effort @SalmanMohammadi!

SalmanMohammadi added 3 commits July 22, 2024 16:47

testing

09443a0

init

487ebae

adding recipe docs draft

1b914ad

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 26, 2024

label bug....

416e6a1

SalmanMohammadi marked this pull request as draft July 26, 2024 13:32

updating

2422b11

pbontrager reviewed Jul 26, 2024

View reviewed changes

SalmanMohammadi added 5 commits August 1, 2024 14:08

init commit

a279cc4

Merge branch 'memory-optimisations-overview' into recipe_docs

6a38504

updating doc preview

04f4407

updating doc preview

78d0984

adding qat recipe, updating template

e970e89

SalmanMohammadi changed the title ~~[WIP][Docs] Recipe docs~~ [RFC][Docs] Recipe docs Aug 2, 2024

SalmanMohammadi marked this pull request as ready for review August 2, 2024 16:05

SalmanMohammadi requested review from joecummings, ebsmothers and pbontrager August 2, 2024 16:06

SalmanMohammadi commented Aug 2, 2024

View reviewed changes

docs/source/tutorials/memory_optimisations.rst Outdated Show resolved Hide resolved

fixing titling

8c0204c

SalmanMohammadi changed the title ~~[RFC][Docs] Recipe docs~~ [RFC][DOCS] Recipe [DOCS] ([DOC]umentation) Aug 2, 2024

SalmanMohammadi mentioned this pull request Aug 2, 2024

RLHF with PPO #1005

Merged

19 tasks

SalmanMohammadi requested review from karthikprasad, kartikayk and andrewor14 and removed request for karthikprasad, pbontrager, ebsmothers and andrewor14 August 2, 2024 17:10

SalmanMohammadi added 2 commits August 5, 2024 15:40

addressing comments

eb93be1

Merge branch 'recipe_docs' of github.com:SalmanMohammadi/torchtune in…

4292014

…to recipe_docs

ebsmothers reviewed Aug 5, 2024

View reviewed changes

SalmanMohammadi added 4 commits August 6, 2024 09:28

adding guidance for using different mdoels before I forget

c7bd45a

addressing comments

683a139

removing config

89da954

sp

dc3bcbf

felipemello1 reviewed Aug 13, 2024

View reviewed changes

adding mem opt

5d09ca1

SalmanMohammadi requested review from RdoubleA, ebsmothers and felipemello1 August 19, 2024 13:09

SalmanMohammadi added 2 commits August 19, 2024 15:21

updating template

1f35783

adding n step note

dbfc121

felipemello1 reviewed Aug 19, 2024

View reviewed changes

docs/source/tutorials/memory_optimizations.rst Outdated Show resolved Hide resolved

further callouts in table

f212380

felipemello1 reviewed Aug 19, 2024

View reviewed changes

updated table

546224a

a little more callout

cdbe0d9

SalmanMohammadi mentioned this pull request Aug 22, 2024

[RFC] RLHF follow-ups #1395

Closed

8 tasks

updating tutorials

3d06179

joecummings approved these changes Aug 24, 2024

View reviewed changes

SalmanMohammadi merged commit 158a400 into pytorch:main Aug 24, 2024
20 checks passed

SalmanMohammadi deleted the recipe_docs branch September 17, 2024 09:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC][DOCS] Recipe [DOCS] ([DOC]umentation) #1230

[RFC][DOCS] Recipe [DOCS] ([DOC]umentation) #1230

SalmanMohammadi commented Jul 26, 2024 •

edited

Loading

pytorch-bot bot commented Jul 26, 2024 •

edited

Loading

pbontrager left a comment

RdoubleA commented Aug 5, 2024

SalmanMohammadi commented Aug 5, 2024

ebsmothers Aug 5, 2024

SalmanMohammadi Aug 8, 2024

felipemello1 left a comment •

edited

Loading

felipemello1 Aug 13, 2024 •

edited

Loading

SalmanMohammadi Aug 19, 2024

ebsmothers Aug 24, 2024

SalmanMohammadi Aug 24, 2024

SalmanMohammadi Aug 24, 2024

felipemello1 Aug 13, 2024

SalmanMohammadi Aug 19, 2024

felipemello1 Aug 19, 2024

SalmanMohammadi Aug 19, 2024

felipemello1 Aug 13, 2024

SalmanMohammadi Aug 19, 2024

felipemello1 Aug 13, 2024 •

edited

Loading

SalmanMohammadi Aug 19, 2024

ebsmothers Aug 24, 2024

SalmanMohammadi commented Aug 13, 2024

SalmanMohammadi commented Aug 19, 2024 •

edited

Loading

felipemello1 Aug 19, 2024

SalmanMohammadi Aug 19, 2024

felipemello1 commented Aug 19, 2024

SalmanMohammadi commented Aug 19, 2024

joecummings left a comment

		torchtune comes with a host of plug-and-play memory optimization components which give you lots of flexibility
		to ``tune`` our recipes to your hardware. This page provides a brief glossary of these components and how you might use them.

[RFC][DOCS] Recipe [DOCS] ([DOC]umentation) #1230

[RFC][DOCS] Recipe [DOCS] ([DOC]umentation) #1230

Conversation

SalmanMohammadi commented Jul 26, 2024 • edited Loading

Test plan

pytorch-bot bot commented Jul 26, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/1230

✅ No Failures

pbontrager left a comment

Choose a reason for hiding this comment

RdoubleA commented Aug 5, 2024

SalmanMohammadi commented Aug 5, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

felipemello1 left a comment • edited Loading

Choose a reason for hiding this comment

felipemello1 Aug 13, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

felipemello1 Aug 13, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SalmanMohammadi commented Aug 13, 2024

SalmanMohammadi commented Aug 19, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

felipemello1 commented Aug 19, 2024

SalmanMohammadi commented Aug 19, 2024

joecummings left a comment

Choose a reason for hiding this comment

SalmanMohammadi commented Jul 26, 2024 •

edited

Loading

pytorch-bot bot commented Jul 26, 2024 •

edited

Loading

felipemello1 left a comment •

edited

Loading

felipemello1 Aug 13, 2024 •

edited

Loading

felipemello1 Aug 13, 2024 •

edited

Loading

SalmanMohammadi commented Aug 19, 2024 •

edited

Loading