add lamb optimizer #7389

L1aoXingyu · 2022-01-27T03:09:25Z

为 eager 和 graph 增加 lamb optimizer interface

原始公式如下

第一步 normalize gradients 可以在 clip grad 中实现。

L1aoXingyu · 2022-01-27T03:09:35Z

MARD1NO · 2022-01-27T03:21:13Z

oneflow/api/python/functional/dispatch_stateful_ops.cpp

@@ -14,6 +14,7 @@ See the License for the specific language governing permissions and
 limitations under the License.
 */

+#include <memory>


python/oneflow/nn/optimizer/lamb.py

…lambOptim

wyg1997 · 2022-01-27T08:33:51Z

oneflow/user/kernels/model_update_kernel_util.cu

-      stream, n, scale, l1, l2, beta1, beta2, epsilon, weight_decay, learning_rate, scale_by_ptr,
-      skip_if, reinterpret_cast<const half*>(model_diff), adam_diff, model, m, v, norm_buffer,
-      beta1_t, beta2_t);
+      stream, n, scale, l1, l2, beta1, beta2, epsilon, weight_decay, learning_rate_val, do_bias_correction, bias_correction1_val, bias_correction2_val, learning_rate_ptr, bias_correction1_ptr, bias_correction2_ptr, scale_by_ptr,


这个 of_format 不会做分行吗？

…lambOptim

github-actions · 2022-01-28T01:12:09Z

Code got formatted by CI. Please request CI again if you still want to have this PR merged. If the PR is from a forked repo, please download the patch files from the GitHub Actions web page and apply them locally.

github-actions · 2022-01-28T04:59:25Z

CI failed when running job: cuda-misc. PR label automerge has been removed

…lambOptim

…com:Oneflow-Inc/oneflow into dev_lxy_lambOptim

github-actions · 2022-01-28T10:55:12Z

Speed stats:

GPU Name: GeForce GTX 1080 

✔️ OneFlow resnet50 time: 136.7ms (= 13672.1ms / 100, input_shape=[16, 3, 224, 224])
PyTorch resnet50 time: 138.5ms (= 13847.1ms / 100, input_shape=[16, 3, 224, 224])
✔️ Relative speed: 1.01 (= 138.5ms / 136.7ms)

✔️ OneFlow resnet50 time: 78.4ms (= 7840.1ms / 100, input_shape=[8, 3, 224, 224])
PyTorch resnet50 time: 83.6ms (= 8355.2ms / 100, input_shape=[8, 3, 224, 224])
✔️ Relative speed: 1.07 (= 83.6ms / 78.4ms)

OneFlow resnet50 time: 52.4ms (= 10479.3ms / 200, input_shape=[4, 3, 224, 224])
PyTorch resnet50 time: 59.1ms (= 11821.5ms / 200, input_shape=[4, 3, 224, 224])
✔️ Relative speed: 1.13 (= 59.1ms / 52.4ms)

OneFlow resnet50 time: 39.4ms (= 7883.8ms / 200, input_shape=[2, 3, 224, 224])
PyTorch resnet50 time: 43.6ms (= 8728.0ms / 200, input_shape=[2, 3, 224, 224])
✔️ Relative speed: 1.11 (= 43.6ms / 39.4ms)

OneFlow resnet50 time: 37.6ms (= 7521.6ms / 200, input_shape=[1, 3, 224, 224])
PyTorch resnet50 time: 38.6ms (= 7725.2ms / 200, input_shape=[1, 3, 224, 224])
✔️ Relative speed: 1.03 (= 38.6ms / 37.6ms)

✔️ OneFlow resnet50 time: 148.0ms (= 14799.4ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 158.5ms (= 15849.5ms / 100, input_shape=[16, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.07 (= 158.5ms / 148.0ms)

OneFlow resnet50 time: 88.9ms (= 8885.5ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 104.8ms (= 10482.0ms / 100, input_shape=[8, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.18 (= 104.8ms / 88.9ms)

OneFlow resnet50 time: 67.0ms (= 13390.7ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 76.6ms (= 15320.3ms / 200, input_shape=[4, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.14 (= 76.6ms / 67.0ms)

OneFlow resnet50 time: 57.3ms (= 11459.8ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 62.8ms (= 12554.7ms / 200, input_shape=[2, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.10 (= 62.8ms / 57.3ms)

OneFlow resnet50 time: 57.3ms (= 11456.9ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
PyTorch resnet50 time: 59.6ms (= 11919.6ms / 200, input_shape=[1, 3, 224, 224], ddp, world size=2)
✔️ Relative speed: 1.04 (= 59.6ms / 57.3ms)

L1aoXingyu added 4 commits January 26, 2022 10:56

finish lamb interface

4ced11a

finish lamb optim

c7cd109

refine docs

ab47f24

add math equation

4f62308

L1aoXingyu requested review from daquexian, doombeaker, guo-ran, jackalcooper, liujuncheng and MARD1NO as code owners January 27, 2022 03:09

L1aoXingyu added feature op labels Jan 27, 2022

MARD1NO approved these changes Jan 27, 2022

View reviewed changes

refine

231d042

L1aoXingyu requested a review from wyg1997 January 27, 2022 06:58

Merge branch 'master' of github.com:Oneflow-Inc/oneflow into dev_lxy_…

3796454

…lambOptim

wyg1997 approved these changes Jan 27, 2022

View reviewed changes

L1aoXingyu added 2 commits January 28, 2022 09:05

format

fb8019a

Merge branch 'master' of github.com:Oneflow-Inc/oneflow into dev_lxy_…

f55be4e

…lambOptim

L1aoXingyu requested a review from oneflow-ci-bot January 28, 2022 01:10

L1aoXingyu added the automerge label Jan 28, 2022

auto format by CI

7773d7b

L1aoXingyu requested review from oneflow-ci-bot and removed request for oneflow-ci-bot January 28, 2022 01:24

Merge branch 'master' into dev_lxy_lambOptim

6db707e

oneflow-ci-bot requested review from oneflow-ci-bot and removed request for oneflow-ci-bot January 28, 2022 02:15

github-actions bot removed the automerge label Jan 28, 2022

oneflow-ci-bot removed their request for review January 28, 2022 05:00

L1aoXingyu added 3 commits January 28, 2022 14:33

Merge branch 'master' of github.com:Oneflow-Inc/oneflow into dev_lxy_…

bc8c1d4

…lambOptim

add utils

74f1dfa

Merge branches 'dev_lxy_lambOptim' and 'dev_lxy_lambOptim' of github.…

eb3d103

…com:Oneflow-Inc/oneflow into dev_lxy_lambOptim

L1aoXingyu requested a review from oneflow-ci-bot January 28, 2022 07:49

L1aoXingyu added the automerge label Jan 28, 2022

Merge branch 'master' into dev_lxy_lambOptim

5ab83a4

oneflow-ci-bot requested review from oneflow-ci-bot and removed request for oneflow-ci-bot January 28, 2022 08:49

jackalcooper merged commit 9f658f0 into master Jan 28, 2022

jackalcooper deleted the dev_lxy_lambOptim branch January 28, 2022 10:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add lamb optimizer #7389

add lamb optimizer #7389

L1aoXingyu commented Jan 27, 2022

L1aoXingyu commented Jan 27, 2022

MARD1NO Jan 27, 2022

wyg1997 Jan 27, 2022

github-actions bot commented Jan 28, 2022

github-actions bot commented Jan 28, 2022

github-actions bot commented Jan 28, 2022

add lamb optimizer #7389

add lamb optimizer #7389

Conversation

L1aoXingyu commented Jan 27, 2022

L1aoXingyu commented Jan 27, 2022

MARD1NO Jan 27, 2022

Choose a reason for hiding this comment

wyg1997 Jan 27, 2022

Choose a reason for hiding this comment

github-actions bot commented Jan 28, 2022

github-actions bot commented Jan 28, 2022

github-actions bot commented Jan 28, 2022