用pytorch做图像的多分类问题报错，可有偿

正在用pytorch做图像的多分类问题，网上的模板是二分类的，然后我想改成多分类的，模型训练没问题，但是到训练过程可视化的时候提示错误，一直过不去，有没有人可以帮忙看看，有偿也ok

现在报错信息是：RuntimeError: CUDA error: device-side assert triggered

CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.

报错这部分的代码如下：


```python
def visualize_model(model, num_images=6):
    was_training = model.training
    model.eval()
    images_handeled = 0
    fig = plt.figure()

    with torch.no_grad():
        for i, (inputs, labels) in enumerate(dataloaders[test_path]):
            inputs = inputs.to(device)
            labels = labels.to(device)

            outputs = model(inputs)
            _, preds = torch.max(outputs, 1)

            for j in range(inputs.size()[0]):
                images_handeled += 1
                ax = plt.subplot(num_images//2, 2, images_handeled)
                ax.axis('off')
                ax.set_title('predicted: {}'.format(class_names[preds[j]]))
                imshow(inputs.cpu().data[j])

                if images_handeled == num_images:
                    model.train(mode=was_training)
                    return
        model.train(mode=was_training)

base_model = train_model(resnet50, criterion, optimizer, exp_lr_scheduler, num_epochs=6)
visualize_model(base_model)
plt.show()



报错的具体信息如下：


RuntimeError                              Traceback (most recent call last)
Input In [56], in <cell line: 27>()
     24                     return
     25         model.train(mode=was_training)
---> 27 base_model = train_model(resnet50, criterion, optimizer, exp_lr_scheduler, num_epochs=6)
     28 visualize_model(base_model)
     29 plt.show()

Input In [54], in train_model(model, criterion, optimizer, scheduler, num_epochs)
      3 def train_model(model, criterion, optimizer, scheduler, num_epochs=20):
      4     since = time.time()
----> 6     best_model_wts = copy.deepcopy(model.state_dict())
      7     best_acc = 0.0
      9     for epoch in range(num_epochs):

File ~\anaconda3\lib\copy.py:172, in deepcopy(x, memo, _nil)
    170                 y = x
    171             else:
--> 172                 y = _reconstruct(x, memo, *rv)
    174 # If is its own copy, don't memoize.
    175 if y is not x:

File ~\anaconda3\lib\copy.py:296, in _reconstruct(x, memo, func, args, state, listiter, dictiter, deepcopy)
    294     for key, value in dictiter:
    295         key = deepcopy(key, memo)
--> 296         value = deepcopy(value, memo)
    297         y[key] = value
    298 else:

File ~\anaconda3\lib\copy.py:153, in deepcopy(x, memo, _nil)
    151 copier = getattr(x, "__deepcopy__", None)
    152 if copier is not None:
--> 153     y = copier(memo)
    154 else:
    155     reductor = dispatch_table.get(cls)

File ~\anaconda3\lib\site-packages\torch\_tensor.py:134, in Tensor.__deepcopy__(self, memo)
    125         raise RuntimeError(
    126             "The default implementation of __deepcopy__() for wrapper subclasses "
    127             "only works for subclass types that implement clone() and for which "
   (...)
    131             "different type."
    132         )
    133 else:
--> 134     new_storage = self.storage().__deepcopy__(memo)
    135     if self.is_quantized:
    136         # quantizer_params can be different type based on torch attribute
    137         quantizer_params: Union[
    138             Tuple[torch.qscheme, float, int],
    139             Tuple[torch.qscheme, Tensor, Tensor, int],
    140         ]

File ~\anaconda3\lib\site-packages\torch\storage.py:597, in TypedStorage.__deepcopy__(self, memo)
    596 def __deepcopy__(self, memo):
--> 597     return self._new_wrapped_storage(copy.deepcopy(self._storage, memo))

File ~\anaconda3\lib\copy.py:153, in deepcopy(x, memo, _nil)
    151 copier = getattr(x, "__deepcopy__", None)
    152 if copier is not None:
--> 153     y = copier(memo)
    154 else:
    155     reductor = dispatch_table.get(cls)

File ~\anaconda3\lib\site-packages\torch\storage.py:97, in _StorageBase.__deepcopy__(self, memo)
     95 if self._cdata in memo:
     96     return memo[self._cdata]
---> 97 new_storage = self.clone()
     98 memo[self._cdata] = new_storage
     99 return new_storage

File ~\anaconda3\lib\site-packages\torch\storage.py:111, in _StorageBase.clone(self)
    109 def clone(self):
    110     """Returns a copy of this storage"""
--> 111     return type(self)(self.nbytes(), device=self.device).copy_(self)

RuntimeError: CUDA error: device-side assert triggered
CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.

属于初学者了，很多错误信息不是很看得懂，我在网上搜了好久，看起来好像是标签还是数据的问题，但是照着改了好多种，都还是不行，不知道是模型的问题还是什么，但是上一步模型训练是没有报错的，这一步模型训练完可视化训练过程才报错，完全一头雾水

如果可以完美解决，私信聊具体价格

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除
收藏举报

5条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
爱晚乏客游 2022-12-28 11:29
关注
你这报错在这里面，跟你贴的代码一点关系没有，都还没执行到这里呢。
看报错信息，应该是你的输出口设置不对，原来两分类是两个，你多分类的话全连接层的输出也要改的。

本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决 2
无用
评论打赏
分享
举报编辑记录

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(4条)

报告相同问题？

关注问题

Pytorch 之torch.nn初探第4关：卷积--Convolution Layers
2022-10-05 09:45

ad_m1n的博客有偿提供CS的人工智能/网络空间安全方向的大学生课程设计、算法设计、项目设计的思路及实现指导；竞赛PPT、项目申报书等撰写润色等。经过前几节的的学习，想必大家对torch.nn有了初步的了解。接下来，向大家介绍...
【图像分类】基于深度学习的杂草类型识别（9种类别，ResNet网络）
2024-03-22 08:00

机器不学习我学习的博客图像分类项目实战往期回顾：【图像分类】基于yolov5的钢板表面缺陷分类(附代码和数据集) 【图像分类】基于PyTorch搭建LSTM实现MNIST手写数字体识别（单向LSTM，附完整代码和数据集）【图像分类】基于PyTorch搭建...
SD教程|从零开始，手把手教你本地部署Stable Diffusion Webui AI绘画
2025-06-03 21:44

网络安全架构师月月的博客 StableDiffusion是一款基于[深度学习的图像生成模型，它能够在没有任何人类指导的情况下生成高质量、逼真的图像。想要在自己的电脑上体验StableDiffusion的强大功能吗？本文将带你一步步了解如何在本地部署Stable ...
从零开始，手把手教你本地部署Stable Diffusion Webui AI绘画
2024-06-19 10:38

网络安全工程师老皮的博客大家好我是AI极客菌Stable Diffusion是一款基于深度学习的图像生成模型，它能够在没有任何人类指导的情况下生成高质量、逼真的图像。想要在自己的电脑上体验Stable Diffusion的强大功能吗？本文将带你一步步了解如何...
小白必看！Stable Diffusion Webui 本地部署全流程，手把手教你搭建 AI 绘画环境
2025-05-06 14:42

程序员一东的博客在人工智能图像生成领域，StableDiffusion作为深度学习技术的集大成者，无需人工干预，便能输出细节丰富、质感逼真的图像作品，成为众多创作者的效率神器。想要摆脱云端限制，在个人电脑上自由驾驭这一前沿技术？...
我用RTX4090显卡赚回了买它的钱
2025-09-25 15:23

Waiyuet Fung的博客本文探讨RTX4090显卡如何通过AI训练、算力租赁、渲染接单等模式实现投资回报，结合硬件配置、自动化运维与收益模型，系统阐述从单卡盈利到技术资产化运营的可行路径。
【双目深度估计】9种立体匹配算法整合包一键运行，支持相机标定、目标检测和tensorrt加速
2025-07-10 11:45

迟钝皮纳德的博客其中立体匹配包含如下算法：模型名称说明 sgbm 经典立体匹配算法，基于opencv实现 psm 2016年，精度较高，由于年份比较早，有很多改版，这里选用了比较稳定的pytorch版本 unimatch 2023年，统一光流、立体匹配和...
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
系统已结题 1月5日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
已采纳回答 12月28日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
创建了问题 12月28日