Skip to content

Add lr scheduler, weight decay and max_grad_norm #572

Add lr scheduler, weight decay and max_grad_norm

Add lr scheduler, weight decay and max_grad_norm #572