Separate preconditioned update from gradient update #2003
samdporter
started this conversation in
Ideas
Replies: 1 comment 3 replies
-
Hi @samdporter - this looks like swapping the order of the step size and preconditioner calculations. So that you calculate the step-size based on the un-preconditioned gradient (where gradient is used). Armijo rule uses the gradient? Do we want the step size to be calculated on the preconditioned or not preconditioned gradient there? |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hey all,
Would it make more sense to separate the gradient update from the preconditioned update during the
update
method of an algorithm?For example in FISTA:
Could instead be something like
This would be helpful for debugging - the gradient update and preconditioned update could both be accessed by a callback - and also for other use cases such as a line search, where both the preconditioned update and gradient update may be required.
Beta Was this translation helpful? Give feedback.
All reactions