Update Qwen2.5 configs #1999

joecummings · 2024-11-13T18:36:33Z

I turned activation checkpointing off for all 0.5B models and for 1.5B LoRA models. No point.
I turned on memory logging

Everything else is cosmetic.

pytorch-bot · 2024-11-13T18:36:36Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/1999

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 8596e5a with merge base 18d97f0 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

recipes/configs/qwen2_5/0.5B_full.yaml

ebsmothers · 2024-11-13T18:45:31Z

recipes/configs/qwen2_5/0.5B_full_single_device.yaml

+
+# Model arguments
+model:
+  _component_: torchtune.models.qwen2_5.qwen2_5_0_5b


I understand that you're making the filename change 0_5B -> 0.5B for consistency with other configs, but honestly would prefer to just move everything to 0_5B so it matches the builders (doesn't have to be in this PR though)

+1, i would prefer if we avoided using periods for names, and only use them when they are a path or file type

for llama3.2 we added 3.2 to the path like you did, but not the components

This is the same thing we have here, no?

calvinpelletier

I thought we use underscores instead of periods?: #1863 (comment)

recipes/configs/qwen2_5/1.5B_full.yaml

ebsmothers · 2024-11-13T18:53:18Z

recipes/configs/qwen2_5/1.5B_full_single_device.yaml

 log_every_n_steps: 1
-log_peak_memory_stats: False
-
-# Profiler (disabled)


controversial take but if we want consistency we should leave these in. idc too much for this PR but I thought that was the whole point of a bunch of @felipemello1's changes. Either way would like to compress this config substantially in a separate PR

I am ok with making the profiler simpler is a separate PR

ebsmothers

I hate decimal points

joecummings · 2024-11-13T18:55:36Z

I thought we use underscores instead of periods?: #1863 (comment)

Yeah this is a misleading comment. We do use underscores for model builders, but the model should just get downloaded to a directory with the exact same name as the model on the Hub.

Update Qwen2.5 configs

72e40fb

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 13, 2024

Typo

88f7b1c

ebsmothers reviewed Nov 13, 2024

View reviewed changes

recipes/configs/qwen2_5/0.5B_full.yaml Outdated Show resolved Hide resolved

ebsmothers reviewed Nov 13, 2024

View reviewed changes

calvinpelletier reviewed Nov 13, 2024

View reviewed changes

ebsmothers reviewed Nov 13, 2024

View reviewed changes

recipes/configs/qwen2_5/1.5B_full.yaml Outdated Show resolved Hide resolved

ebsmothers reviewed Nov 13, 2024

View reviewed changes

ebsmothers approved these changes Nov 13, 2024

View reviewed changes

joecummings added 3 commits November 13, 2024 11:00

Address nit

227ab6c

Update comment

32ae3cd

I hate this

8596e5a

joecummings merged commit 1eb7785 into pytorch:main Nov 13, 2024
16 checks passed

joecummings deleted the update-qwen2.5-stuff branch November 13, 2024 19:32

joecummings added a commit that referenced this pull request Nov 13, 2024

Update Qwen2.5 configs (#1999)

03a4b1e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Qwen2.5 configs #1999

Update Qwen2.5 configs #1999

joecummings commented Nov 13, 2024 •

edited

Loading

pytorch-bot bot commented Nov 13, 2024 •

edited

Loading

ebsmothers Nov 13, 2024 •

edited

Loading

felipemello1 Nov 13, 2024 •

edited

Loading

felipemello1 Nov 13, 2024 •

edited

Loading

joecummings Nov 13, 2024

calvinpelletier left a comment

ebsmothers Nov 13, 2024

felipemello1 Nov 13, 2024

ebsmothers left a comment

joecummings commented Nov 13, 2024 •

edited

Loading

Update Qwen2.5 configs #1999

Update Qwen2.5 configs #1999

Conversation

joecummings commented Nov 13, 2024 • edited Loading

pytorch-bot bot commented Nov 13, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/1999

✅ No Failures

ebsmothers Nov 13, 2024 • edited Loading

Choose a reason for hiding this comment

felipemello1 Nov 13, 2024 • edited Loading

Choose a reason for hiding this comment

felipemello1 Nov 13, 2024 • edited Loading

Choose a reason for hiding this comment

joecummings Nov 13, 2024

Choose a reason for hiding this comment

calvinpelletier left a comment

Choose a reason for hiding this comment

ebsmothers Nov 13, 2024

Choose a reason for hiding this comment

felipemello1 Nov 13, 2024

Choose a reason for hiding this comment

ebsmothers left a comment

Choose a reason for hiding this comment

joecummings commented Nov 13, 2024 • edited Loading

joecummings commented Nov 13, 2024 •

edited

Loading

pytorch-bot bot commented Nov 13, 2024 •

edited

Loading

ebsmothers Nov 13, 2024 •

edited

Loading

felipemello1 Nov 13, 2024 •

edited

Loading

felipemello1 Nov 13, 2024 •

edited

Loading

joecummings commented Nov 13, 2024 •

edited

Loading