Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

training results are flat #7

Closed
gburachas opened this issue Oct 27, 2015 · 1 comment
Closed

training results are flat #7

gburachas opened this issue Oct 27, 2015 · 1 comment

Comments

@gburachas
Copy link

HI,
I ran ./test_lstm_long.sh and other scripts on an Ubuntu box, and the result in the log file is flat:
...
-0.138143 0.087998
-0.118875 0.087998
-0.0895481 0.087998
-0.0522323 0.087998
-0.00979323 0.087998
...

Any suggestions?
Also, by default the mode is CPU, and the scripts crash when the flag in *solver.prototxt is changed to GPU. I use C2070, which works fine with other caffe projects.

thanks-
GTB

@junhyukoh
Copy link
Owner

Hi,

  • There was a bug in LSTM when "clip" is set to 1 for the first time-step. I fixed it now.
  • I changed my code and prototxts to support GPU mode in examples.
    However, it will be slower than CPU mode because of the data copy overhead from CPU to GPU.
    In general, GPU mode is much faster than CPU mode with large-scale data.

Thank you for reporting this!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants