tensorflow-gpu Failed to get convolution algorithm.

成功安装了gpu版的tensorflow之后，尝试跑两个神经网
第一个：全连接的DNN
关键代码如下：

xs=tf.placeholder(tf.float32,[None,10])
ys=tf.placeholder(tf.float32,[None,7])


'layer1:ful connect'
W_fc1=weight_variable([10,5000],name_data=None) 
b_fc1=bias_variable([5000],name_data=None)

h_fc1=tf.nn.relu(tf.matmul(xs,W_fc1)+b_fc1)

'layer2:ful connect'
W_fc2=weight_variable([5000,5000],name_data=None) 
b_fc2=bias_variable([5000],name_data=None)

h_fc2=tf.nn.relu(tf.matmul(h_fc1,W_fc2)+b_fc2)

'layer3:ful connect'
W_fc3=weight_variable([5000,5000],name_data=None) 
b_fc3=bias_variable([5000],name_data=None)

h_fc3=tf.nn.relu(tf.matmul(h_fc2,W_fc3)+b_fc3)

'output layer::ful connect,maxsoft'
W_fc4=weight_variable([5000,7],name_data=None) 
b_fc4=bias_variable([7],name_data=None)


output=tf.nn.sigmoid(tf.matmul(h_fc3,W_fc4)+b_fc4)

能够顺利的利用gpu加速，确实比cpu的计算速度快不少。
然而，在跑cnn的时候（部分代码如下）

'def weights'
def weight_variable(shape,name_data): 
    initial=tf.truncated_normal(shape,stddev=0.1)
    return tf.Variable(initial,dtype=tf.float32,name=name_data)

'def biases'
def bias_variable(shape,name_data): 
    initial=tf.constant(0.1,shape=shape) 
    return tf.Variable(initial,dtype=tf.float32,name=name_data)

'def conv2d layer'
def conv2d(x,W):
    return tf.nn.conv2d(x,W,strides=[1,1,1,1],padding='SAME')

'def pooling layer as max_pool'
def max_pool_2x2_v(x): 
    return tf.nn.max_pool(x,ksize=[1,2,2,1],strides=[1,2,2,1],padding='VALID')

'def pooling layer as max_pool'
def max_pool_2x2_s(x): 
    return tf.nn.max_pool(x,ksize=[1,2,2,1],strides=[1,1,1,1],padding='SAME')


#input layer
'placeholder xs & ys'
xs=tf.placeholder(tf.float32,[None,64])
ys=tf.placeholder(tf.float32,[None,1])
'reshape the xs as x_image,which shape is 10*10'
x_image=tf.reshape(xs,[-1,8,8,1])
print('red input::',x_image)


#layer2:conv layer 2 patches
'patch1'
W_conv_r_1_1=weight_variable([3,3,1,20],name_data='W_conv_r_1_1')
b_conv_r_1_1=bias_variable([20],name_data='b_conv_r_1_1')
h_conv_r_1_1=tf.nn.relu6(conv2d(x_image,W_conv_r_1_1)+b_conv_r_1_1)
'patch2'
W_conv_r_1_2=weight_variable([3,3,1,10],name_data='W_conv_r_1_2')
b_conv_r_1_2=bias_variable([10],name_data='b_conv_r_1_2')
h_conv_r_1_2=tf.nn.relu6(conv2d(x_image,W_conv_r_1_2)+b_conv_r_1_2)
'concat to layer2'
h_conv_r_1=tf.concat([h_conv_r_1_1,h_conv_r_1_2],3)
print("red layer2::",h_conv_r_1)

#layer3:conv layer:1 patch add with h_conv_r_1_2
'patch1'
W_conv_r_2_1=weight_variable([5,5,30,30],name_data='W_conv_r_2_1')
b_conv_r_2_1=bias_variable([30],name_data='b_conv_r_2_1')
h_conv_r_2_1=tf.nn.elu(conv2d(h_conv_r_1,W_conv_r_2_1)+b_conv_r_2_1)
'patch for next layer'
W_conv_r_2_2=weight_variable([5,5,30,15],name_data='W_conv_r_2_2')
b_conv_r_2_2=bias_variable([15],name_data='b_conv_r_2_2')
h_conv_r_2_2=tf.nn.elu(conv2d(h_conv_r_1,W_conv_r_2_2)+b_conv_r_2_2)
'concat for layer3'
h_conv_r_2=tf.concat([h_conv_r_2_1,h_conv_r_1_2],3)
print('red layer3;:',h_conv_r_2)

上述代码是一个利用cnn训练黑白棋的程序，可以在CPU环境下顺利的运行，但是在gpu环境下，运行时会报错：Failed to get convolution algorithm （无法获得卷积算法）
完整的报错信息如下：

Traceback (most recent call last):
  File "C:\Users\fengg\AppData\Local\Programs\Python\Python35\lib\site-packages\tensorflow\python\client\session.py", line 1334, in _do_call
    return fn(*args)
  File "C:\Users\fengg\AppData\Local\Programs\Python\Python35\lib\site-packages\tensorflow\python\client\session.py", line 1319, in _run_fn
    options, feed_dict, fetch_list, target_list, run_metadata)
  File "C:\Users\fengg\AppData\Local\Programs\Python\Python35\lib\site-packages\tensorflow\python\client\session.py", line 1407, in _call_tf_sessionrun
    run_metadata)
tensorflow.python.framework.errors_impl.UnknownError: Failed to get convolution algorithm. This is probably because cuDNN failed to initialize, so try looking to see if a warning log message was printed above.
     [[{{node Conv2D}} = Conv2D[T=DT_FLOAT, data_format="NHWC", dilations=[1, 1, 1, 1], padding="SAME", strides=[1, 1, 1, 1], use_cudnn_on_gpu=true, _device="/job:localhost/replica:0/task:0/device:GPU:0"](Reshape, W_conv_r_1_1/read)]]
     [[{{node Sigmoid/_75}} = _Recv[client_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device_incarnation=1, tensor_name="edge_105_Sigmoid", tensor_type=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:CPU:0"]()]]

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "C:\Users\fengg\Desktop\Othello with ResNet  3\Othello with ResNet-large\Othello with ResNet-large\train_ResNet.py", line 326, in <module>
    try_point=sess.run(prediction_r, feed_dict={xs:board_try,ys:[[0.0001]]})
  File "C:\Users\fengg\AppData\Local\Programs\Python\Python35\lib\site-packages\tensorflow\python\client\session.py", line 929, in run
    run_metadata_ptr)
  File "C:\Users\fengg\AppData\Local\Programs\Python\Python35\lib\site-packages\tensorflow\python\client\session.py", line 1152, in _run
    feed_dict_tensor, options, run_metadata)
  File "C:\Users\fengg\AppData\Local\Programs\Python\Python35\lib\site-packages\tensorflow\python\client\session.py", line 1328, in _do_run
    run_metadata)
  File "C:\Users\fengg\AppData\Local\Programs\Python\Python35\lib\site-packages\tensorflow\python\client\session.py", line 1348, in _do_call
    raise type(e)(node_def, op, message)
tensorflow.python.framework.errors_impl.UnknownError: Failed to get convolution algorithm. This is probably because cuDNN failed to initialize, so try looking to see if a warning log message was printed above.
     [[node Conv2D (defined at C:\Users\fengg\Desktop\Othello with ResNet  3\Othello with ResNet-large\Othello with ResNet-large\train_ResNet.py:31)  = Conv2D[T=DT_FLOAT, data_format="NHWC", dilations=[1, 1, 1, 1], padding="SAME", strides=[1, 1, 1, 1], use_cudnn_on_gpu=true, _device="/job:localhost/replica:0/task:0/device:GPU:0"](Reshape, W_conv_r_1_1/read)]]
     [[{{node Sigmoid/_75}} = _Recv[client_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device_incarnation=1, tensor_name="edge_105_Sigmoid", tensor_type=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:CPU:0"]()]]

Caused by op 'Conv2D', defined at:
  File "<string>", line 1, in <module>
  File "C:\Users\fengg\AppData\Local\Programs\Python\Python35\lib\idlelib\run.py", line 130, in main
    ret = method(*args, **kwargs)
  File "C:\Users\fengg\AppData\Local\Programs\Python\Python35\lib\idlelib\run.py", line 357, in runcode
    exec(code, self.locals)
  File "C:\Users\fengg\Desktop\Othello with ResNet  3\Othello with ResNet-large\Othello with ResNet-large\train_ResNet.py", line 57, in <module>
    h_conv_r_1_1=tf.nn.relu6(conv2d(x_image,W_conv_r_1_1)+b_conv_r_1_1)
  File "C:\Users\fengg\Desktop\Othello with ResNet  3\Othello with ResNet-large\Othello with ResNet-large\train_ResNet.py", line 31, in conv2d
    return tf.nn.conv2d(x,W,strides=[1,1,1,1],padding='SAME')
  File "C:\Users\fengg\AppData\Local\Programs\Python\Python35\lib\site-packages\tensorflow\python\ops\gen_nn_ops.py", line 1044, in conv2d
    data_format=data_format, dilations=dilations, name=name)
  File "C:\Users\fengg\AppData\Local\Programs\Python\Python35\lib\site-packages\tensorflow\python\framework\op_def_library.py", line 787, in _apply_op_helper
    op_def=op_def)
  File "C:\Users\fengg\AppData\Local\Programs\Python\Python35\lib\site-packages\tensorflow\python\util\deprecation.py", line 488, in new_func
    return func(*args, **kwargs)
  File "C:\Users\fengg\AppData\Local\Programs\Python\Python35\lib\site-packages\tensorflow\python\framework\ops.py", line 3274, in create_op
    op_def=op_def)
  File "C:\Users\fengg\AppData\Local\Programs\Python\Python35\lib\site-packages\tensorflow\python\framework\ops.py", line 1770, in __init__
    self._traceback = tf_stack.extract_stack()

UnknownError (see above for traceback): Failed to get convolution algorithm. This is probably because cuDNN failed to initialize, so try looking to see if a warning log message was printed above.
     [[node Conv2D (defined at C:\Users\fengg\Desktop\Othello with ResNet  3\Othello with ResNet-large\Othello with ResNet-large\train_ResNet.py:31)  = Conv2D[T=DT_FLOAT, data_format="NHWC", dilations=[1, 1, 1, 1], padding="SAME", strides=[1, 1, 1, 1], use_cudnn_on_gpu=true, _device="/job:localhost/replica:0/task:0/device:GPU:0"](Reshape, W_conv_r_1_1/read)]]
     [[{{node Sigmoid/_75}} = _Recv[client_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device_incarnation=1, tensor_name="edge_105_Sigmoid", tensor_type=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:CPU:0"]()]]

请问这个问题该如何解决，谢谢了！

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
threenewbee 2018-11-16 15:19
关注
我不知道这样算不算运行了，没有报错

我的软硬件：
Windows 10 1803 x64 10.0.17134.407
CUDA Version 9.2.148
CUDNN 7.1.4
Tensorflow 1.9.0
Python 3.6.4 (v3.6.4:d48eceb, Dec 19 2017, 06:54:40)
CPU Intel Core 2 Duo E4600 2.4GHz
GPU NVIDIA Geforce GTX650 2GB GDDR5
RAM 2.7GB

你把 windows cuda cudnn tf python 这几样尽量按照我的版本来

11.17.2018 2150更新

cuda下载方式
https://developer.nvidia.com/cuda-92-download-archive
选择windows 10
cuDNN下载方式
你登录 https://developer.nvidia.com/cudnn
然后注册一个用户，然后登录后点那个下载，勾选agree
选择Archived cuDNN Releases
然后看下图

要根据你的cuda版本和windows版本（windows必须是64bit的）来选择
我这里因为是windows 10和cuda 9.2，所以选择cuDNN v7.1.4 Library for Windows 10

最后附上两个文件的文件名和大小，你下载完核对下，如果字节数能一致，就没问题了。
文件名 cuda_9.2.148_win10.exe 大小 1.47 GB (1,583,355,224 bytes)
文件名 cudnn-9.2-windows10-x64-v7.1.zip 大小206 MB (216,853,802 bytes)
文件比较大，我暂时就不传了，如果实在不行，再说。

本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

tensorflow-gpu Failed to get convolution algorithm. tensorflow 神经网络
2018-11-16 12:59

回答 1 已采纳 ![](https://img-bbs.csdn.net/upload/201811/16/1542381557_315523.gif) 我不知道这样算不算运行了，没有报错我的软硬件：
为什么我定义的变量未定义[重复] php
2016-01-10 14:09

回答 3 已采纳 In PHP, functions cannot access variables that are within the global scope unless the keyword glob
用tensorflow做训练os.environ['CUDA_VISIBLE_DEVICES'] = '/gpu:0' 无法调用gpu执行 tensorflow 人工智能深度学习
2021-09-05 22:51

回答 1 已采纳 os.environ['CUDA_VISIBLE_DEVICES'] = '0' 你就一张显卡，那肯定是写个0就可以了啊，也就是默认编号为0的显卡，你指定1，2，3的话你本身又没有多显卡，那只能
Failed to get convolution algorithm. This is probably because cuDNN failed to initialize, so try…
2021-01-06 23:52

使用inception-V3批处理文件retrain.bat进行预训练时，发现报错“Failed to get convolution algorithm. This is probably because cuDNN failed to initialize, so try…”于是打开retrain.py文件，在前面加上如下...
tensorflow出现这种错误是怎么回事？ python tensorflow 有问必答计算机视觉
2021-05-24 15:43

回答 2 已采纳因为显存不够，降低batchsize即可。参考(1条消息) tensorflow训练3dcnn报错：NotFoundError: No algorithm worked!_今天又是不求上进的一天的
Error: cuDNN isn't found FWD algo for convolution windows 深度学习
2021-08-05 18:22

回答 1 已采纳看下cuda和cudnn版本是否满足原本的需求。另外一种原因好像是显存不够，但是也会报这个错误。，你可以看下 https://www.ccoderun.ca/programming/darknet
tensorflow2.0报错 python 有问必答
2021-07-06 19:36

回答 2 已采纳你好，我是有问必答小助手。为了技术专家团更好地为您解答问题，烦请您补充下（1）问题背景详情，（2）您想解决的具体问题，（3）问题相关代码图片或者报错信息。便于技术专家团更好地理解问题，并给出解决方案。
tensorflow卷积报错Failed to get convolution algorithm. This is probably because cuDNN failed t
2021-07-05 20:06

集电极的博客 tensorflow2.0卷积报错Failed to get convolution algorithm. This is probably because cuDNN failed t 说明：环境是tensorflow2.0.0,测试GPU没有问题。但在运行模型出现错误。 UnknownError: Failed to get ...
做mnist识别时出现错误 AttributeError: module 'keras.api._v2.keras' has no attribute 'train' keras tensorflow 神经网络
2022-09-19 20:31

回答 1 已采纳现在用的是tf2吧，那应该要这样子写tf.keras.optimizers.Adam
机器学习执行使报错，显示属性缺失机器学习
2019-08-30 13:16

回答 3 已采纳 classify0函数没有定义或者没有导入如果是没有定义，参考 https://www.jianshu.com/p/551fb62a2b94
tensorflow的probability在pytorch中有没有对应的包深度学习
2023-03-14 23:13

回答 3 已采纳 pytorch里面还没有与convolution1dflipout相对应的卷积层，你要不然就结合pyro这种概率编程库自己实现这个层，要不然就要用pytorch里的自定义层的功能自己实现。 impor
Tensorflow调试经验---“Failed to get convolution algorithm. This is probably because cuDNN failed to ini”
2021-06-28 23:42

摸鱼的^_^的博客 Failed to get convolution algorithm. This is probably because cuDNN failed to initialize 解决方案： #tensorflow1.X版本 from tensorflow.compat.v1 import ConfigProto from tensorflow.compat.v1 import I
def改成class tensorflow
2022-01-06 13:28

回答 2 已采纳这不是直接开启套娃模式，外面包个class就行了？ class Model: def __init__(self,IMG_SHAPE=(224,224,3),class_num=10):
讲解Unknown: Failed to get convolution algorithm. This is probably because cuDNN
2023-12-22 09:05

牛肉胡辣汤的博客 "Unknown: Failed to get convolution algorithm. This is probably because cuDNN"错误通常与cuDNN库的卷积算法获取失败有关。在解决这个错误时，你需要注意cuDNN库的版本兼容性，确保正确安装和设置cuDNN库，以及...
Tensorflow：UnknownError: Failed to get convolution algorithm. This is probably because cuDNN failed
2022-04-13 18:30

知名牛马范某的博客使用卷积网络进行手写数字识别，在TensorFlow-gpu 1.15.0遇到的问题，报错信息太多就没有认真去看，直接去粘下来去百度了。再试了各种方法，只有加入 import os os.environ["CUDA_VISIBLE_DEVICES"] = "gpu:0" ...
tensorflow.python.framework.errors_impl.UnknownError: Failed to get convolution algorithm.
2024-03-25 16:08

luyanpingya的博客这个错误信息来自于TensorFlow，在尝试使用CUDA和cuDNN库执行卷积神经网络（CNN）操作时遇到问题。具体错误是，这通常意味着在尝试初始化cuDNN或者在执行一个cuDNN相关的函数（在这个例子中可能是做卷积运算）时遇到...
Failed to get convolution algorithm. This is probably because cuDNN failed to initialize
2022-06-01 14:46

冰虺的博客在对maskrcnn进行预测时，出现了以上...特此感谢，大佬们的分享(17条消息) tensorflow报错：Failed to get convolution algorithm.This is probably because cuDNN failed to initialize_qingtian11112的博客-CSDN博客
Tensorflow升级后Failed to get convolution algorithm. This is probably because cuDNN
2020-04-30 15:45

Allenhong97的博客系统原始环境： 1. tensorflow：2.0.alpha ...将tensorflow升级至2.0正式版后运行程序报错：Failed to get convolution algorithm. This is probably because cuDNN··· 解决方案：从官网下载最新版本的c...
“Failed to get convolution algorithm. This is probably because cuDNN failed to initialize”错误的解决办法
2019-11-06 16:51

史丹利复合田的博客 Failed to get convolution algorithm. This is probably because cuDNN failed to initialize 一开始怀疑是CUDA和CuDNN配置错误（要求版本匹配）。反复试验后，还是有这个错误。最后发现可能是GPU内存不足造成的...
没有解决我的问题, 去提问

悬赏问题

¥15 metadata提取的PDF元数据，如何转换为一个Excel
¥15 关于arduino编程toCharArray()函数的使用
¥100 vc++混合CEF采用CLR方式编译报错
¥15 coze 的插件输入飞书多维表格 app_token 后一直显示错误，如何解决？
¥15 vite+vue3+plyr播放本地public文件夹下视频无法加载
¥15 c#逐行读取txt文本，但是每一行里面数据之间空格数量不同
¥50 如何openEuler 22.03上安装配置drbd
¥20 ING91680C BLE5.3 芯片怎么实现串口收发数据
¥15 无线连接树莓派，无法执行update，如何解决？（相关搜索：软件下载）
¥15 Windows11, backspace, enter, space键失灵

tensorflow-gpu Failed to get convolution algorithm.

1条回答 默认 最新

11.17.2018 2150更新

悬赏问题

1条回答默认最新