Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

image classification benchmark and finetune #5892

Closed
jacquesqiao opened this issue Nov 24, 2017 · 6 comments
Closed

image classification benchmark and finetune #5892

jacquesqiao opened this issue Nov 24, 2017 · 6 comments
Assignees

Comments

@jacquesqiao
Copy link
Member

jacquesqiao commented Nov 24, 2017

project: #6024

@jacquesqiao jacquesqiao self-assigned this Nov 24, 2017
@jacquesqiao jacquesqiao changed the title benchmark image classification with tf image classification benchmark and finetune Dec 4, 2017
@jacquesqiao
Copy link
Member Author

jacquesqiao commented Dec 4, 2017

Info

1 pass

Paddle fluid:

Total examples: 1120, total time: 162.26692
6.90221 examples/sec, 5.07084 sec/batch
pass_acc: [ 0.07985038]

Tensorflow

Total examples: 1120, total time: 69.05409
16.21917 examples/sec, 2.15794 sec/batch

@jacquesqiao
Copy link
Member Author

jacquesqiao commented Dec 4, 2017

Info

4 pass

PaddlePaddle fluid

pass_acc: [ 0.27012524]
Total examples: 1120, total time: 608.13095
1.84171 examples/sec, 19.00409 sec/batch

Tensorflow

pass=3, batch=192, loss=1.980749, acc=0.400000

Total examples: 1120, total time: 236.70867
4.73155 examples/sec, 7.39715 sec/batch

@qingqing01
Copy link
Contributor

qingqing01 commented Dec 4, 2017

@jacquesqiao I tested PaddlePaddle fluid in TITAN X (Pascal) by using cuDNN and removing the stream synchronization between each operator. The total time of one pass of PaddlePaddle fluid is 70.47459 sec. From your testing, the time of TensorFlow is 69.05409 sec.

Config:

  • 1 pass
  • batch_size: 32
  • Singe GPU
  • cuDNN 5.1

Time:

Total examples: 6149, total time: 70.47459
87.25131 examples/sec, 0.36515 sec/batch

@jacquesqiao
Copy link
Member Author

jacquesqiao commented Dec 5, 2017

@qingqing01 cool! I will also have a try~

@dzhwinter
Copy link
Contributor

If we use the Finish implement the synchronized training. That's really hurt performance a lot when doing the training. How about put Finish barrier at the executor finish line?

@jacquesqiao
Copy link
Member Author

jacquesqiao commented Dec 11, 2017

condition:

  • use a fixed numpy as data.
  • drop the first 10 iteration

result:

  • Fluid:
    Total examples: 2240, total time: 16.79173
    133.39901 examples/sec, 0.23988 sec/batch

  • Tensorflow:
    Total examples: 2240, total time: 15.99317
    140.05980 examples/sec, 0.22847 sec/batch

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants