Add New lr scheduler #1393

sdbds · 2024-06-28T02:09:11Z

Change Lr schedulers library from diffusers to transformers, they use same library but transformers had more lr schedulers.
Considering the community has always wanted custom learning rate, I think these newly added regulators will meet the demand.

Add inverse sqrt learning rate scheduler

new argument lr_scheduler_timescale,default to warms_up_steps

Add cosine with min lr scheduler
When set the num_steps=100, num_warmup_steps=10, lr=0.2, min_lr=0.01. The learning rate looks like:

new argument lr_scheduler_min_lr_ratio,default to 0 as cosine lr scheduler.

Add WSD scheduler
The ladder scheduler that so many people have been wanting.

new argument lr_decay_steps,
new argument lr_scheduler_min_lr_ratio,default to 0.

need update requirement transformers==4.41.2

sdbds · 2024-08-30T15:28:09Z

Fix bugs now, i think it works well.
@kohya-ss

kohya-ss · 2024-09-01T09:39:33Z

Thank you! However, removing PIECEWISE_CONSTANT will likely impact users. I would like to update the code after merging so that it can be used, but it will take some time. I'd like to prioritize some other PRs. If you could fix this, that would be great.

sdbds · 2024-09-01T16:55:31Z

Thank you! However, removing PIECEWISE_CONSTANT will likely impact users. I would like to update the code after merging so that it can be used, but it will take some time. I'd like to prioritize some other PRs. If you could fix this, that would be great.

OK，just keep diffusers import for backup using, use

name = SchedulerType(name) or DiffusersSchedulerType(name)
schedule_func = TYPE_TO_SCHEDULER_FUNCTION[name] or DIFFUSERS_TYPE_TO_SCHEDULER_FUNCTION[name]

to judge if using PIECEWISE_CONSTANT.
And i add parser type for input float with warmup and decay ratio, don't need to calculate training total steps.

kohya-ss · 2024-09-09T11:40:27Z

Sorry for bothering you again. Is there a reason to upgrade the version of the library other than transformers? It requires some more comprehensive testing, which takes time.

sdbds · 2024-09-09T11:56:49Z

Sorry for bothering you again. Is there a reason to upgrade the version of the library other than transformers? It requires some more comprehensive testing, which takes time.

The main thing is to upgrade the transformers version, and after upgrading the transformers version the other two dependencies will also ask for an update, so it's three version updates.
Since the PR is older, I reviewed the latest three dependency update histories, and there don't seem to be any major bug fixes as well as disruptive changes.

kohya-ss · 2024-09-09T12:38:25Z

Thanks, I understand. Then it seems like there is no big problem. I've updated accelerate and transformers in the sd3 branch, so maybe I can match that. I'll do some checks and merge :)

library/train_util.py

kohya-ss · 2024-09-11T12:45:12Z

Sorry for the delay, I have merged this.

sdbds added 5 commits June 28, 2024 09:30

add new lr scheduler

9c782f3

add new lr scheduler

a31252b

fix bugs and use num_cycles / 2

dc6767a

Update requirements.txt

5488b51

add num_cycles for min lr

005a232

sdbds added 2 commits September 2, 2024 00:23

keep PIECEWISE_CONSTANT

545fef2

allow use float with warmup or decay ratio.

717c379

Update train_util.py

416e521

kohya-ss reviewed Sep 10, 2024

View reviewed changes

library/train_util.py Show resolved Hide resolved

kohya-ss merged commit fd68703 into kohya-ss:dev Sep 11, 2024
1 check passed

kohya-ss added a commit that referenced this pull request Sep 11, 2024

Fix to work PIECEWISE_CONSTANT, update requirement.txt and README #1393

6dbfd47

This was referenced Sep 15, 2024

got an unexpected keyword argument 'num_decay_steps' bmaltais/kohya_ss#2812

Open

unexpected keyword argument 'num_decay_steps' when using cosine or linear scheduler (sd3 branch) #1602

Closed

kohya-ss added a commit that referenced this pull request Sep 16, 2024

fix to work lienar/cosine lr scheduler closes #1602 ref #1393

96c677b

kohya-ss added a commit that referenced this pull request Sep 29, 2024

fix to work linear/cosine scheduler closes #1651 ref #1393

012e7e6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add New lr scheduler #1393

Add New lr scheduler #1393

sdbds commented Jun 28, 2024

sdbds commented Aug 30, 2024

kohya-ss commented Sep 1, 2024

sdbds commented Sep 1, 2024

kohya-ss commented Sep 9, 2024

sdbds commented Sep 9, 2024 •

edited

Loading

kohya-ss commented Sep 9, 2024

kohya-ss commented Sep 11, 2024

Add New lr scheduler #1393

Add New lr scheduler #1393

Conversation

sdbds commented Jun 28, 2024

sdbds commented Aug 30, 2024

kohya-ss commented Sep 1, 2024

sdbds commented Sep 1, 2024

kohya-ss commented Sep 9, 2024

sdbds commented Sep 9, 2024 • edited Loading

kohya-ss commented Sep 9, 2024

kohya-ss commented Sep 11, 2024

sdbds commented Sep 9, 2024 •

edited

Loading