-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Blas SGEMM launch failed #362
Comments
The same problem as you, can you tell me what happend and how to solve it? |
I have met it many times...how to solve it? |
I noticed some bugs in the original cuda runfile. I fixed it by installing the four patches of CUDA in https://developer.nvidia.com/cuda-90-download-archive?target_os=Linux&target_arch=x86_64&target_distro=Ubuntu&target_version=1604&target_type=runfilelocal. It does work and hope it works for you. |
Thank you so much. A few days ago, I run the some code with two 1080Ti GPUs, it works. So I guess there might be some problems in CUDA support in 2080Ti GPU as you said :) |
The comments above may help you~ |
you are right! Thanks |
Thank u so much! |
With those patches installed, my RTX2070 on win10 is still having the same issue. |
I have the same GPU RTX2070 and having the same problem in Ubunto 16.04! Tensorflow 1.8 |
直接sudo sh cuda_9.0.176_384.81_linux.run会报错?run不需要的 |
安装cuda的时候应该补丁一起装上了,为啥还要再装这个。。。 |
The base installer is available for download below. |
怎么知道这四个补丁是不是安装成功呢,有没有测试命令 |
安装过程显示的和cuda安装时一样的,一般也不会报啥错的,再跑一下原来的程序看会不会继续报错嘛 |
During handling of the above exception, another exception occurred: Traceback (most recent call last): Caused by op 'conv2d_3/convolution', defined at: InternalError (see above for traceback): Blas SGEMM launch failed : m=43264, n=32, k=64 |
有解决吗 ?跟你一样的问题 docker环境TensorFlow1.8.0 cuda9.0 keras2.2.4 |
I used cude9.0 with tf1.8 at first, and then I fixed this problem by changing it to cuda9.2 . |
It does work for me, thanks a lot |
Input image filename:images/img00030.jpg help me |
Try reinstalling TensorFlow to
|
thank you so much |
I've tried tensorflow==1.12.0 (cpu version). It's ok. |
when I run "python yolo_video.py --image", it loads model successfully. But then I input the name of test picture, error occurs as follows:
Input image filename:1.jpg
(416, 416, 3)
2019-03-07 14:23:30.749652: E tensorflow/stream_executor/cuda/cuda_blas.cc:647] failed to run cuBLAS routine cublasSgemm_v2: CUBLAS_STATUS_EXECUTION_FAILED
Traceback (most recent call last):
File "/home/jmh/.conda/envs/tfgpu/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1322, in _do_call
return fn(*args)
File "/home/jmh/.conda/envs/tfgpu/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1307, in _run_fn
options, feed_dict, fetch_list, target_list, run_metadata)
File "/home/jmh/.conda/envs/tfgpu/lib/python3.6/site-packages/tensorflow/python/client/session.py", line 1409, in _call_tf_sessionrun
run_metadata)
tensorflow.python.framework.errors_impl.InternalError: Blas SGEMM launch failed : m=43264, n=32, k=64
[[Node: conv2d_3/convolution = Conv2D[T=DT_FLOAT, _class=["loc:@batch_normalization_3/cond/FusedBatchNorm/Switch"], data_format="NHWC", dilations=[1, 1, 1, 1], padding="SAME", strides=[1, 1, 1, 1], use_cudnn_on_gpu=true, _device="/job:localhost/replica:0/task:0/device:GPU:0"](leaky_re_lu_2/LeakyRelu, conv2d_3/kernel/read)]]
[[Node: boolean_mask_29/GatherV2/_3363 = _Recvclient_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device_incarnation=1, tensor_name="edge_6054_boolean_mask_29/GatherV2", tensor_type=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:CPU:0"]]
My GPU is RTX2080Ti 8G*2, tensorflow-gpu 1.9.0, keras 2.2.4.
I feel so annoyed by this problem, who can tell me what happend and how to solve it?
The text was updated successfully, but these errors were encountered: