运行tensorflow时出现tensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch failed这个错误

运行tensorflow时出现tensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch failed这个错误，查了一下说是gpu被占用了，从下面这里开始出问题的：

2019-10-17 09:28:49.495166: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1304] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 6382 MB memory) -> physical GPU (device: 0, name: GeForce GTX 1070, pci bus id: 0000:01:00.0, compute capability: 6.1)
(60000, 28, 28) (60000, 10)
2019-10-17 09:28:51.275415: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'cublas64_100.dll'; dlerror: cublas64_100.dll not found

图片说明

最后显示的问题：

图片说明
试了一下网上的方法，比如加代码：

gpu_options = tf.GPUOptions(per_process_gpu_memory_fraction=0.333)
sess = tf.Session(config=tf.ConfigProto(gpu_options=gpu_options))

但最后提示：

图片说明

现在不知道要怎么解决了。新手想试下简单的数字识别，步骤也是按教程一步步来的，可能用的版本和教程不一样，我用的是刚下的：2.0tensorflow和以下：

图片说明

不知道会不会有版本问题，现在紧急求助各位大佬，还有没有其它可以尝试的方法。测试程序加法运算可以执行，数字识别图片运行的时候我看了下，GPU最大占有率才0.2%，下面是完整数字图片识别代码：

import os
import tensorflow as tf
from tensorflow import keras
from tensorflow.keras import layers, optimizers, datasets

os.environ['TF_CPP_MIN_LOG_LEVEL'] = '2'

#gpu_options = tf.GPUOptions(per_process_gpu_memory_fraction=0.2)
#sess = tf.Session(config=tf.ConfigProto(gpu_options=gpu_options))
gpu_options = tf.GPUOptions(per_process_gpu_memory_fraction=0.333)
sess = tf.Session(config=tf.ConfigProto(gpu_options=gpu_options))

(x, y), (x_val, y_val) = datasets.mnist.load_data()
x = tf.convert_to_tensor(x, dtype=tf.float32) / 255.
y = tf.convert_to_tensor(y, dtype=tf.int32)
y = tf.one_hot(y, depth=10)
print(x.shape, y.shape)
train_dataset = tf.data.Dataset.from_tensor_slices((x, y))
train_dataset = train_dataset.batch(200)

model = keras.Sequential([
    layers.Dense(512, activation='relu'),
    layers.Dense(256, activation='relu'),
    layers.Dense(10)])

optimizer = optimizers.SGD(learning_rate=0.001)


def train_epoch(epoch):
    # Step4.loop
    for step, (x, y) in enumerate(train_dataset):

        with tf.GradientTape() as tape:
            # [b, 28, 28] => [b, 784]
            x = tf.reshape(x, (-1, 28 * 28))
            # Step1. compute output
            # [b, 784] => [b, 10]
            out = model(x)
            # Step2. compute loss
            loss = tf.reduce_sum(tf.square(out - y)) / x.shape[0]

        # Step3. optimize and update w1, w2, w3, b1, b2, b3
        grads = tape.gradient(loss, model.trainable_variables)
        # w' = w - lr * grad
        optimizer.apply_gradients(zip(grads, model.trainable_variables))

        if step % 100 == 0:
            print(epoch, step, 'loss:', loss.numpy())


def train():
    for epoch in range(30):
        train_epoch(epoch)


if __name__ == '__main__':
    train()

希望能有人给下建议或解决方法，拜谢！

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

3条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
plus_left 2022-04-07 19:46
关注
请问这个问题解决了吗

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

【tensorflow报错】tensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch failed：XXX
2021-12-16 14:09

人工智能程序源的博客 tensorflow报错解决
tensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch failed
2022-04-14 20:21

jxx29wendken的博客此错误主要是GPU的可用内存不足引起的错误，解决方法如下： import tensorflow as tf import os os.environ["CUDA_VISIBLE_DEVICES"] = '0' #或者'1' 调用运行GPU的编号 # 定义TensorFlow配置 config = tf....
【2022年】解决tensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch failed
2022-04-24 23:15

newbie,,,的博客在网上搜了半天，大部分是说GPU被占用，或者是向量维数不正确什么的。但是我的问题并不在这里。解决方式：加入Tensorflow显存设置。 import cv2 import tensorflow.co
【已解决】“tensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch failed“
2021-05-23 17:17

不爱喝牛奶的哈士奇的博客错：“tensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch failed” 提示：查阅相关资料，怀疑是tensorflow的版本问题当前配置如下： tensorflow-gpu:1.15.0 cuda:10.0 cudnn:7.6.5 python...
tensorflow.python.framework.errors_impl.InternalError: Blas xGEMM launch failed
2021-11-04 15:34

小宝学技术的博客 tensorflow.python.framework.errors_impl.InternalError: Blas xGEMM launch failed : a.shape=[1,480000,64], b.shape=[1,480000,64], m=64, n=64, k=480000 [Op:Einsum] 查阅资料找到了以下两种解决方案： 1.在...
tensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch failed问题解决思路之一
2019-12-09 19:39

Fire_dadada~的博客运行tensorflow时出现tensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch failed这个错误运行tensorflow时出现tensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch ...
【tensorflow.python.framework.errors_impl.InternalError: Blas SGEMM launch failed】错误解决方案
2020-11-16 21:50

望天边星宿的博客 E tensorflow/stream_executor/cuda/cuda_blas.cc:652] failed to run cuBLAS routine cublasSgemm_v2: CUBLAS_STATUS_EXECUTION_FAILED Traceback (most recent call last): File "E:/Project/keras-yolo3-person&...
tensorflow报错:tensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch failed :
2020-01-09 13:28

尚墨1111的博客 #在tensorflow 2.0 里面，要想一个高阶迭代多次调用tf.GradientTape()时因为tape是一次性的，算完就会释放，所以要想重复调用必须设置persistent=’True‘，但是如果忘记了释放就会导致GPU被占用 w = tf.constant(1....
解决 tensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch failed
2019-03-16 14:43

Jaichg的博客 tensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch failed : a.shape=(1, 10), b.shape=(10, 2), m=10, n=2, k=10 [Op:MatMul] 原因： GPU被占用。tensorflow sess = tf.Sessio...
报错：ensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch failed
2019-08-21 09:14

LQF_的博客报错：ensorflow.python.framework.errors_impl.Internal...在用tensorflow2.0跑代码的时候，报这个错误：ensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch failed；解决方法——同时开启...
【解决方案】tensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch failed
2019-11-26 21:30

小风_的博客 tensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch failed： a.shape=(xx, xx), b.shape=(xx,xx), m=10, n=2, k=10 [Op:MatMul] 配置： tensorflow-gpu 2.0 pycharm 最佳...
tensorflow.python.framework.errors_impl.InternalError: 2 root error(s) found. (0) Internal: Blas GEM
2024-07-30 09:17

朋也透william的博客 InternalError: 2 root error(s) found. 均为 Internal: Blas GEMM launch failed_2 root erros found-CSDN博客https://blog.csdn.net/baidu_33597755/article/details/102311000
【超级小白代码路】错误：InternalError: Blas GEMM launch failed : a.shape=
2024-09-04 20:01

hi_ine的博客后来在一番验证后发现虽然任务管理器显示有两个显卡，且gpu0对应集显，gpu1对应独显，但是在代码运行时显示的gpu0对应的仍然是独显，可以在代码中输入显示使用设备信息查证，例如我的代码运行会显示。，我用的是...
InternalError:2 root error(s) found. Internal: Blas GEMM launch failed报错的解决办法Python
2023-03-13 22:18

xiajili的博客改报错！tensorflow.python.framework.errors_impl.InternalError: 2 root error(s) found.Internal: Blas GEMM launch failed
【tensorflow】InternalError: 2 root error(s) found. 均为 Internal: Blas GEMM launch failed
2020-12-08 11:14

浪子私房菜的博客原因：显存不足所造成。解决方案：在代码最前方加入代码 import os os.environ["CUDA_VISIBLE_DEVICES"] = "0" config = tf.ConfigProto(allow_soft_placement = True) gpu_options = tf.GPUOptions(per_process_...
InternalError: Blas GEMM launch failed : a.shape=(100, 784), b.shape=(784, 10), m=100, n=10另外一种问题的可能
2020-05-01 11:43

麓山南麓的博客 InternalError: Blas GEMM launch failed : a.shape=(100, 784), b.shape=(784, 10), m=100, n=10。在jupyter notebook出现这种错误，我通过cmd查看GPU使用情况，却发现没有程序占用GPU内存，也不是虚拟环境和...
【深度学习】训练时现Interal Error：Blas GEMM launch failed.
2021-07-08 09:55

无水先生的博客 Tensorflow程序运行中出现"Interal Error：Blas GEMM launch failed."，此错误主要是由于程序运行时GPU的空间不足而引起的。故一般出现此错误的时候，会发现程序提示的GPU freeMemory 少。
jupyter中显存不足，出现错误：InternalError: Blas GEMM launch failed : a.shape=(128, 784), b.shape=(784, 512),
2020-08-24 23:09

weixin_44426328的博客 jupyter中进行训练，代码正常但出现了错误：InternalError: Blas GEMM launch failed : a.shape=(128, 784), b.shape=(784, 512), m=128, n=512, k=784，经过查询是因为显存不足，这一点可以从两方面进行证实 1、...
Blas GEMM launch failed 错误解决方案
2021-10-12 16:10

UX_LEGEND的博客 tensorflow.python.framework.errors_impl.InternalError: 2 root error(s) found. (0) Internal: Blas GEMM launch failed : a.shape=(10, 10), b.shape=(10, 10), m=10, n=10, k=10 [[{{node sequential/simple_...
没有解决我的问题, 去提问

运行tensorflow时出现tensorflow.python.framework.errors_impl.InternalError: Blas GEMM launch failed这个错误

3条回答 默认 最新

3条回答默认最新