[C++] implement polyak-ruppert averaging for gradient descent #390

suntzu86 · 2014-08-20T03:21:37Z

C++'s gradient descent code currently does not support any kind of averaging.

We should implement polyak-ruppert averaging. This is already done in moe.optimal_learning.python.python_version.optimization.GradientDescentDescentOptimizer.optimize so porting it should be straightfoward.

This hasn't proven to be much of a hindrance insofar as the results obtained in Python with/without averaging have been comparable (i.e., the final gradient hasn't bee much better either way). Still we should be consistent and this averaging is generally a good idea.

The text was updated successfully, but these errors were encountered:

suntzu86 added enhancement labels Aug 20, 2014

suntzu86 mentioned this issue Aug 20, 2014

[C++] have GradientDescentParameters track a num_steps_averaged field #391

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[C++] implement polyak-ruppert averaging for gradient descent #390

[C++] implement polyak-ruppert averaging for gradient descent #390

suntzu86 commented Aug 20, 2014

[C++] implement polyak-ruppert averaging for gradient descent #390

[C++] implement polyak-ruppert averaging for gradient descent #390

Comments

suntzu86 commented Aug 20, 2014