-
Notifications
You must be signed in to change notification settings - Fork 634
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to replace AllReduce with Reduce in parameter server mode? #467
Comments
Note the reason parameter server mode takes a mean of the gradients instead of a sum is that |
@reedwm Thanks for getting back to me about |
On a single device, However, with parameter server mode, |
@reedwm
I know |
benchmarks/scripts/tf_cnn_benchmarks/variable_mgr_util.py
Line 575 in 5d03cf8
@reedwm
In parameter server mode, I managed to replace
grad = tf.add_n(grads)
bynccl all reduce
:I tried to figure out a way to accomplish the
sum
without usingall-reduce
since I only need one copy of the sum, notnumber-of-gpus
copies of sum. Intf
, is there areduce
API I can use? Thanks.The text was updated successfully, but these errors were encountered: