问题
用LSTM时,报错:
UnknownError: [_Derived_] Fail to find the dnn implementation.
[[{{node CudnnRNN}}]]
[[sequential/lstm/StatefulPartitionedCall]] [Op:__inference_distributed_function_3171]
Function call stack:
distributed_function -> distributed_function -> distributed_function
用CNN时,报错:
UnknownError: Failed to get convolution algorithm. This is probably because cuDNN failed to initialize, so try looking to see if a warning log message was printed above.
[[node sequential/conv2d/Conv2D (defined at <ipython-input-23-2b9859ad26ab>:4) ]] [Op:__inference_distributed_function_1747]
Function call stack:
distributed_function
解决方法
添加下面代码:
from tensorflow.compat.v1 import ConfigProto
from tensorflow.compat.v1 import InteractiveSession
config = ConfigProto()
config.gpu_options.allow_growth = True
session = InteractiveSession(config=config)
或者如下:(两段意思相同)
config = tf.compat.v1.ConfigProto(allow_soft_placement=True)
config.gpu_options.per_process_gpu_memory_fraction = 0.3
tf.compat.v1.keras.backend.set_session(tf.compat.v1.Session(config=config))
可能是GPU内存不足造成的。
意思是对GPU进行按需分配。
主要原因是我的图像比较大,消耗GPU资源较多。但我的显卡(RTX2060)显存只有6GB,所以会出现这个错误。这个错误提示有很大的误导性,让人一直纠结CUDA和CuDNN的版本问题。本博文全部是转载的,感谢原博主。