qq_58527127 2023-12-13 21:37 采纳率: 0%
浏览 10

RuntimeError: CUDA error: device-side assert triggered

当我运行使用集合预测网络在GPU服务器上联合提取实体关系的代码时,我遇到了这个错误:
/pytorch/aten/src/ATen/native/cuda/Loss.cu:247: nll_loss_forward_reduce_cuda_kernel_2d: block: [0,0,0], thread: [0,0,0] Assertion t >= 0 && t < n_classes failed.
Traceback (most recent call last):
File "/root/miniconda3/lib/python3.8/runpy.py", line 194, in _run_module_as_main
return _run_code(code, main_globals, None,
File "/root/miniconda3/lib/python3.8/runpy.py", line 87, in _run_code
exec(code, run_globals)
File "/root/autodl-tmp/SPN4RE/Nr_Partial_ch_SPN4RE-main/main.py", line 102, in
trainer.train_model()
File "/root/autodl-tmp/SPN4RE/Nr_Partial_ch_SPN4RE-main/trainer/trainer.py", line 77, in train_model
loss, _ = self.model(input_ids, attention_mask, targets)
File "/root/miniconda3/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1102, in _call_impl
return forward_call(*input, **kwargs)
File "/root/autodl-tmp/SPN4RE/Nr_Partial_ch_SPN4RE-main/models/setpred4RE.py", line 31, in forward
loss = self.criterion(outputs, targets)
File "/root/miniconda3/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1102, in _call_impl
return forward_call(*input, **kwargs)
File "/root/autodl-tmp/SPN4RE/Nr_Partial_ch_SPN4RE-main/models/set_criterion.py", line 46, in forward
losses.update(self.get_loss(loss, outputs, targets, indices))
File "/root/autodl-tmp/SPN4RE/Nr_Partial_ch_SPN4RE-main/models/set_criterion.py", line 96, in get_loss
return loss_map[loss](outputs, targets, indices, **kwargs)
File "/root/autodl-tmp/SPN4RE/Nr_Partial_ch_SPN4RE-main/models/set_criterion.py", line 113, in entity_loss
head_start_loss = F.cross_entropy(selected_pred_head_start, target_head_start)
File "/root/miniconda3/lib/python3.8/site-packages/torch/nn/functional.py", line 2846, in cross_entropy
return torch._C._nn.cross_entropy_loss(input, target, weight, _Reduction.get_enum(reduction), ignore_index, label_smoothing)
RuntimeError: CUDA error: device-side assert triggered
请问应该怎么解决?

  • 写回答

2条回答 默认 最新

  • IT工程师_二师兄 2023-12-13 22:02
    关注

    你把运行日志复制到记事本发给我

    评论

报告相同问题?

问题事件

  • 创建了问题 12月13日

悬赏问题

  • ¥15 is not in the mmseg::model registry。报错,模型注册表找不到自定义模块。
  • ¥15 安装quartus II18.1时弹出此error,怎么解决?
  • ¥15 keil官网下载psn序列号在哪
  • ¥15 想用adb命令做一个通话软件,播放录音
  • ¥30 Pytorch深度学习服务器跑不通问题解决?
  • ¥15 部分客户订单定位有误的问题
  • ¥15 如何在maya程序中利用python编写领子和褶裥的模型的方法
  • ¥15 Bug traq 数据包 大概什么价
  • ¥15 在anaconda上pytorch和paddle paddle下载报错
  • ¥25 自动填写QQ腾讯文档收集表