resnet forward 函数不输出，设置断点不运行。

为resnet layer1 层之前和layer4层之后分别添加一个cbam注意力模块，进行debug 分别为resnetclass 里ca,sa,和botterleneck的forward设置断点跳不到resnet class 类的forward 函数里去，但会跳到ca 和sa的模块里去，也可以跳到botterneck里去。

resnet_cbam 的代码我是从https://github.com/luuuyi/CBAM.PyTorch 里进行相应更改形成的，其中本来特征图经过ca和sa后会在resnet forward里再乘以原来的特征图，但是报错：期待输入特征图32维，但实际输入维度为1维，即维度不匹配，为了解决此问题，我就将相CA和SA模块里相乘放进了CA和SA里。

问题相关代码，请勿粘贴截图

import torch
import torch.nn as nn
import math
import torch.utils.model_zoo as model_zoo


__all__ = ['ResNet', 'resnet18_cbam', 'resnet34_cbam', 'resnet50_cbam', 'resnet101_cbam',
           'resnet152_cbam']


model_urls = {
    'resnet18': 'https://download.pytorch.org/models/resnet18-5c106cde.pth',
    'resnet34': 'https://download.pytorch.org/models/resnet34-333f7ec4.pth',
    'resnet50': 'https://download.pytorch.org/models/resnet50-19c8e357.pth',
    'resnet101': 'https://download.pytorch.org/models/resnet101-5d3b4d8f.pth',
    'resnet152': 'https://download.pytorch.org/models/resnet152-b121ed2d.pth',
}


def conv3x3(in_planes, out_planes, stride=1):
    "3x3 convolution with padding"
    return nn.Conv2d(in_planes, out_planes, kernel_size=3, stride=stride,
                     padding=1, bias=False)

class ChannelAttention(nn.Module):
    def __init__(self, in_planes, ratio=16):
        super(ChannelAttention, self).__init__()
        self.avg_pool = nn.AdaptiveAvgPool2d(1)
        self.max_pool = nn.AdaptiveMaxPool2d(1)
           
        self.fc = nn.Sequential(nn.Conv2d(in_planes, in_planes // 16, 1, bias=False),
                               nn.ReLU(),
                               nn.Conv2d(in_planes // 16, in_planes, 1, bias=False))
        self.sigmoid = nn.Sigmoid()

    def forward(self, x):
        avg_out = self.fc(self.avg_pool(x))
        max_out = self.fc(self.max_pool(x))
        out = avg_out + max_out
        out = self.sigmoid(out)
        out = out * x

        return out

class SpatialAttention(nn.Module):
    def __init__(self, kernel_size=7):
        super(SpatialAttention, self).__init__()

        self.conv1 = nn.Conv2d(2, 1, kernel_size, padding=kernel_size//2, bias=False)
        self.sigmoid = nn.Sigmoid()

    def forward(self, x):
        tem = x

        avg_out = torch.mean(x, dim=1, keepdim=True)
        max_out, _ = torch.max(x, dim=1, keepdim=True)
        out = torch.cat([avg_out, max_out], dim=1)
        out = self.conv1(out)

        out = self.sigmoid(out)

        # return self.sigmoid(x)
        out = out*tem


        return out

class BasicBlock(nn.Module):
    expansion = 1

    def __init__(self, inplanes, planes, stride=1, downsample=None):
        super(BasicBlock, self).__init__()
        self.conv1 = conv3x3(inplanes, planes, stride)
        self.bn1 = nn.BatchNorm2d(planes)
        self.relu = nn.ReLU(inplace=True)
        self.conv2 = conv3x3(planes, planes)
        self.bn2 = nn.BatchNorm2d(planes)

        # self.ca = ChannelAttention(planes)
        # self.sa = SpatialAttention()

        self.downsample = downsample
        self.stride = stride

    def forward(self, x):
        residual = x

        out = self.conv1(x)
        out = self.bn1(out)
        out = self.relu(out)

        out = self.conv2(out)
        out = self.bn2(out)

        # out = self.ca(out) * out
        # out = self.sa(out) * out

        if self.downsample is not None:
            residual = self.downsample(x)

        out += residual
        out = self.relu(out)

        return out


class Bottleneck(nn.Module):
    expansion = 4

    def __init__(self, inplanes, planes, stride=1, downsample=None):
        super(Bottleneck, self).__init__()
        self.conv1 = nn.Conv2d(inplanes, planes, kernel_size=1, bias=False)
        self.bn1 = nn.BatchNorm2d(planes)
        self.conv2 = nn.Conv2d(planes, planes, kernel_size=3, stride=stride,
                               padding=1, bias=False)
        self.bn2 = nn.BatchNorm2d(planes)
        self.conv3 = nn.Conv2d(planes, planes * 4, kernel_size=1, bias=False)
        self.bn3 = nn.BatchNorm2d(planes * 4)
        self.relu = nn.ReLU(inplace=True)

        # self.ca = ChannelAttention(planes * 4)
        # self.sa = SpatialAttention()

        self.downsample = downsample
        self.stride = stride

    def forward(self, x):
        residual = x

        out = self.conv1(x)
        out = self.bn1(out)
        out = self.relu(out)

        out = self.conv2(out)
        out = self.bn2(out)
        out = self.relu(out)

        out = self.conv3(out)
        out = self.bn3(out)

        # out = self.ca(out) * out
        # out = self.sa(out) * out

        if self.downsample is not None:
            residual = self.downsample(x)

        out += residual
        out = self.relu(out)

        return out


class ResNet(nn.Module):

    def __init__(self, block, layers, num_classes=1000):
        self.inplanes = 64
        super(ResNet, self).__init__()
        self.conv1 = nn.Conv2d(3, 64, kernel_size=7, stride=2, padding=3,
                               bias=False)
        self.bn1 = nn.BatchNorm2d(64)
        self.relu = nn.ReLU(inplace=True)

        # 网络的第一层加入注意力机制
        self.ca = ChannelAttention(self.inplanes)
        self.sa = SpatialAttention()

        self.maxpool = nn.MaxPool2d(kernel_size=3, stride=2, padding=1)
        self.layer1 = self._make_layer(block, 64, layers[0])
        self.layer2 = self._make_layer(block, 128, layers[1], stride=2)
        self.layer3 = self._make_layer(block, 256, layers[2], stride=2)
        self.layer4 = self._make_layer(block, 512, layers[3], stride=2)

        # 网络的卷积层的最后一层加入注意力机制
        self.ca1 = ChannelAttention(self.inplanes)
        self.sa1 = SpatialAttention()

        self.avgpool = nn.AdaptiveAvgPool2d((1, 1))
        self.fc = nn.Linear(512 * block.expansion, num_classes)

        for m in self.modules():
            if isinstance(m, nn.Conv2d):
                n = m.kernel_size[0] * m.kernel_size[1] * m.out_channels
                m.weight.data.normal_(0, math.sqrt(2. / n))
            elif isinstance(m, nn.BatchNorm2d):
                m.weight.data.fill_(1)
                m.bias.data.zero_()

    def _make_layer(self, block, planes, blocks, stride=1):
        downsample = None
        if stride != 1 or self.inplanes != planes * block.expansion:
            downsample = nn.Sequential(
                nn.Conv2d(self.inplanes, planes * block.expansion,
                          kernel_size=1, stride=stride, bias=False),
                nn.BatchNorm2d(planes * block.expansion),
            )

        layers = []
        layers.append(block(self.inplanes, planes, stride, downsample))
        self.inplanes = planes * block.expansion
        for i in range(1, blocks):
            layers.append(block(self.inplanes, planes))

        return nn.Sequential(*layers)

    def forward(self, x):

        x = self.conv1(x)
        x = self.bn1(x)
        x = self.relu(x)



        # x = self.ca(x) * x
        x = self.ca(x)
        x = self.sa(x)
        print("this is the resnet backbone.")



        x = self.maxpool(x)

        x1 = self.layer1(x)
        x2 = self.layer2(x1)
        x3 = self.layer3(x2)
        x4 = self.layer4(x3)

        last = self.ca1(x4)

        last = self.sa1(last)


        last = self.avgpool(last)
        # last = self.avgpool(x4)
        last = torch.flatten(last, 1)
        last = self.fc(last)
        # x = self.avgpool(x)
        # x = torch.flatten(x, 1)
        # x = self.fc(x)
        # return x

        return last


def resnet18_cbam(pretrained=False, **kwargs):
    """Constructs a ResNet-18 model.

    Args:
        pretrained (bool): If True, returns a model pre-trained on ImageNet
    """
    model = ResNet(BasicBlock, [2, 2, 2, 2], **kwargs)
    if pretrained:
        pretrained_state_dict = model_zoo.load_url(model_urls['resnet18'])
        now_state_dict        = model.state_dict()
        now_state_dict.update(pretrained_state_dict)
        model.load_state_dict(now_state_dict)
    return model


def resnet34_cbam(pretrained=False, **kwargs):
    """Constructs a ResNet-34 model.

    Args:
        pretrained (bool): If True, returns a model pre-trained on ImageNet
    """
    model = ResNet(BasicBlock, [3, 4, 6, 3], **kwargs)
    if pretrained:
        pretrained_state_dict = model_zoo.load_url(model_urls['resnet34'])
        now_state_dict        = model.state_dict()
        now_state_dict.update(pretrained_state_dict)
        model.load_state_dict(now_state_dict)
    return model


def resnet50_cbam(pretrained=False, **kwargs):
    """Constructs a ResNet-50 model.

    Args:
        pretrained (bool): If True, returns a model pre-trained on ImageNet
    """
    model = ResNet(Bottleneck, [3, 4, 6, 3], **kwargs)
    if pretrained:
        pretrained_state_dict = model_zoo.load_url(model_urls['resnet50'])
        now_state_dict        = model.state_dict()
        now_state_dict.update(pretrained_state_dict)
        model.load_state_dict(now_state_dict)
    return model


def resnet101_cbam(pretrained=False, **kwargs):
    """Constructs a ResNet-101 model.

    Args:
        pretrained (bool): If True, returns a model pre-trained on ImageNet
    """
    model = ResNet(Bottleneck, [3, 4, 23, 3], **kwargs)
    if pretrained:
        pretrained_state_dict = model_zoo.load_url(model_urls['resnet101'])
        now_state_dict        = model.state_dict()
        now_state_dict.update(pretrained_state_dict)
        model.load_state_dict(now_state_dict)
    return model


def resnet152_cbam(pretrained=False, **kwargs):
    """Constructs a ResNet-152 model.

    Args:
        pretrained (bool): If True, returns a model pre-trained on ImageNet
    """
    model = ResNet(Bottleneck, [3, 8, 36, 3], **kwargs)
    if pretrained:
        pretrained_state_dict = model_zoo.load_url(model_urls['resnet152'])
        now_state_dict        = model.state_dict()
        now_state_dict.update(pretrained_state_dict)
        model.load_state_dict(now_state_dict)
    return model


if __name__ == "__main__":
    test = resnet50_cbam()
    print(test)

运行结果及报错内容

我的解答思路和尝试过的方法

本身我是为了在resnet指定层加CBAM模块，但加入模块后会报输入维度不匹配的错，于是进行debug，为CA和SA设置断点和print语句，结果发现resnet class里的forward函数不输出，打断点也从未经过。
（会不会是GPU并行运行程序，导致不输出啊？但我看另一个博主的bolg，也会输出的啊。）

我想要达到的结果

为resnet第一层残差块结构前加个CBAM模块，和最后一层残差块后加个CBAM模块，以及为什么resnet CLASS的forward函数不输出。
谢谢各位。

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
爱晚乏客游 2022-07-12 09:23
关注
你只是简单的打印模型，肯定不会进入forward啊
简单加个训练或者就可以进入了，你像我这样改一下，然后打断点就可以进去forward函数了。

if __name__ == "__main__": test = resnet50_cbam() img=torch.zeros((1,3,224,224)) out=test(img)
解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

resnet18函数用法 python 有问必答
2022-12-26 20:44

回答 2 已采纳 ResNet18的基本含义是，网络的基本架构是ResNet，网络的深度是18层，里面的10和3应该是按情况设定的输入和输出维度详细可参考
pytorch的resnet猫狗大战跑不完不收敛 python pytorch 机器学习
2022-07-22 12:31

回答 3 已采纳跑得慢是因为没有使用cuda加速
Pytorch用自己的数据训练ResNet后写测试代码是遇到问题 python 开发语言有问必答
2023-02-15 15:50

回答 2 已采纳如果你训练好ResNet模型后，需要进行模型的测试和评估。一般情况下，测试代码会有以下步骤：1、导入测试数据集：在测试代码中，你需要导入测试数据集，它应该和你训练模型时使用的数据集是相同的。2、加载模
resnet加载预训练权重相关代码
2023-03-26 23:35

微凉的衣柜的博客 resnet模型实现及加载预训练权重。
pytorch官网给的resnet程序跑不通 pytorch 人工智能深度学习
2022-03-21 08:30

回答 1 已采纳提示不是说了吗，需要切换函数的API，你用的函数是旧版本的，需要切换到新版本去，报错提示里面也跟你说了要切换哪些什么函数了
pycharm运行代码出现“在 '__init__.pyi' 中找不到引用” pycharm python tensorflow
2022-05-25 20:36

回答 2 已采纳 contrib已经在Tensorflow2.x弃用了，要么换1.x的版本，要么改代码，具体还需要去查一下这个api在2.x版本变成什么了，麻烦的很，早转pytorch早轻松
图像识别代码添加cuda后跑不起来 python pytorch 机器学习
2022-07-26 10:18

回答 3 已采纳 Expected all tensors to be on the same device。也就是说，有的data在cpu上，而有的在cuda上。通过观察代码可以发现你的model是cuda对象，但是
NLP 第五周语言模型，bert（2）
2022-09-16 19:40

我还是喜欢从前的自己的博客 B前配C后，前配后即可，只要保证不和自己的另一端组合就行，然后预测的标签设置为False，也就是说此时前后两段话并不存在前后句关系，相当于前后句没有逻辑关系：这里的三个token是告诉网络句首，断点，句末的，...
如何用resnet训练cifar10 python pytorch 深度学习
2022-08-22 13:01

回答 3 已采纳请看👉 ：残差网络（ResNet）训练（CIFAR10）
resnet50model最后一层的修改 python pytorch 回归
2022-12-16 17:05

回答 1 已采纳如下是详细解答，望采纳在 PyTorch 中，可以使用如下方式修改 ResNet50 模型的最后一层：首先，导入所需的库： import torch import torchvision.model
如何直接调用Pytorch自带的Resnet结构 cnn pytorch 深度学习
2022-10-11 21:29

回答 1 已采纳 B站上有个人讲的不错，你可以去搜一下小土堆
基于Tensorflow 1.x 实现ResNet18
2021-03-30 09:53

/*yetu*/的博客基于Tensorflow 1.x 实现ResNet18 本代码是基于《【北京大学】人工智能实践：Tensorflow笔记》课程添加链接描述自行编写的程序，使用1版本的tensorflow，2版本可使用以下代码兼容：import tensorflow.compat.v1 as ...
ssd的backbone为resnet时，增加并行分支结构会不会有效果？机器学习深度学习目标检测
2022-08-30 14:18

回答 1 已采纳不一定。RepVGG中所说，采用多分支结构比单分支结构精度高一些，那是在重构后参数相同的条件下才有效。backbone级别的并行分支肯定不会增加精度，同样的数据下，参数更多了。你可以考虑使用更大的ba
(Pytorch)ResNet代码复现CIFAR-10数据集
2022-11-25 14:28

Jnmz34的博客程序设置了断点续训，可以接着训练，查看日志可以用tensorboard。
你应该知道的一个PyTorch小技巧
2020-11-15 22:00

小北的北的博客欢迎关注“小白玩转Python”，发现更多 “有趣”使用过深度学习的人都知道，有时候调试模型是非常困难的。张量的不匹配、梯度爆炸，以及其他无数的问题都会让你大吃一惊。解决这些问题需要细...
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
创建了问题 7月11日

悬赏问题

¥15 BP神经网络控制倒立摆
¥20 要这个数学建模编程的代码并且能完整允许出来结果完整的过程和数据的结果
¥15 html5+css和javascript有人可以帮吗？图片要怎么插入代码里面啊
¥30 Unity接入微信SDK 无法开启摄像头
¥20 有偿写代码要用特定的软件anaconda 里的jvpyter 用python3写
¥20 cad图纸，chx-3六轴码垛机器人
¥15 移动摄像头专网需要解vlan
¥20 access多表提取相同字段数据并合并
¥20 基于MSP430f5529的MPU6050驱动，求出欧拉角
¥20 Java-Oj-桌布的计算