Fix documentation to explain how schedules work in optimizers. #399

rosshemsley · 2022-08-23T15:18:33Z

Discussed in #390

^{Originally posted by nalzok August 13, 2022}
For example, in the documentation for optax.adam, we have

learning_rate (Union[float, Callable[[Union[ndarray, float, int]], Union[ndarray, float, int]]]) – this is a fixed global scaling factor.

I noticed that learning_rate can be a function. How can a function be "a fixed global scaling factor"? What parameters should such a function take, and how does its return value affect the optimization process given that Adam has a learning rate schedule on its own?

The text was updated successfully, but these errors were encountered:

The doc was slightly misleading see #399. PiperOrigin-RevId: 604372850

The doc was slightly misleading see #399. PiperOrigin-RevId: 604548137

vroulet · 2024-02-06T07:47:02Z

Done in #778

copybara-service bot pushed a commit that referenced this issue Feb 5, 2024

Detail that learning_rate can be scalar or a schedule.

43d0c23

The doc was slightly misleading see #399. PiperOrigin-RevId: 604372850

copybara-service bot mentioned this issue Feb 5, 2024

Detail that learning_rate can be scalar or a schedule. #778

Merged

copybara-service bot pushed a commit that referenced this issue Feb 5, 2024

Detail that learning_rate can be scalar or a schedule.

39c73fc

The doc was slightly misleading see #399. PiperOrigin-RevId: 604372850

copybara-service bot pushed a commit that referenced this issue Feb 6, 2024

Detail that learning_rate can be scalar or a schedule.

d2bb5f1

The doc was slightly misleading see #399. PiperOrigin-RevId: 604372850

copybara-service bot pushed a commit that referenced this issue Feb 6, 2024

Detail that learning_rate can be scalar or a schedule.

8ba9c02

The doc was slightly misleading see #399. PiperOrigin-RevId: 604548137

vroulet closed this as completed Feb 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix documentation to explain how schedules work in optimizers. #399

Fix documentation to explain how schedules work in optimizers. #399

rosshemsley commented Aug 23, 2022

vroulet commented Feb 6, 2024

Fix documentation to explain how schedules work in optimizers. #399

Fix documentation to explain how schedules work in optimizers. #399

Comments

rosshemsley commented Aug 23, 2022

Discussed in #390

vroulet commented Feb 6, 2024