pytorch搭建Resnet遇到问题

用pytorch复刻Resnet 18的时候遇到了一个问题：

import matplotlib.pyplot as plt
import torch
import torchvision
import torch.nn as nn
from torch.utils.data import DataLoader

trans = torchvision.transforms.ToTensor()
train_set = torchvision.datasets.CIFAR10(root="./CIFAR10",download=True,transform=trans,train=True)
test_set = torchvision.datasets.CIFAR10(root="./CIFAR10",download=True,transform=trans,train=False)

train_dataloader = DataLoader(train_set,shuffle=True,batch_size=128)
test_dataloader = DataLoader(test_set,shuffle=False,batch_size=128)

class my_BasicBlock(nn.Module):
    def __init__(self,in_channal,out_channal,stride):
        super(my_BasicBlock,self).__init__()
        self.net = nn.Sequential(
            nn.Conv2d(in_channal,out_channal,kernel_size=3,stride=stride[0],padding=1),
            nn.BatchNorm2d(out_channal),
            nn.ReLU(inplace=True),
            nn.Conv2d(out_channal,out_channal,kernel_size=3,stride=stride[1],padding=1),
            nn.BatchNorm2d(out_channal)
        )

        self.shortcut = nn.Sequential()
        if stride[0] != 1 or in_channal != out_channal:
            self.shortcut = nn.Sequential(
                nn.Conv2d(in_channal,out_channal,kernel_size=1,stride=stride[0]),
                nn.BatchNorm2d(out_channal)
            )
    def forward(self,x):
        y = self.net(x)
        y += self.shortcut(x)
        y = nn.ReLU(y)
        return y


class my_resnet18(nn.Module):
    def __init__(self,my_BasicBlock,numclasser=10):
        super(my_resnet18,self).__init__()
        self.conv1 = nn.Sequential(
            nn.Conv2d(3,64,7,2,3),
            nn.BatchNorm2d(64),
            nn.MaxPool2d(kernel_size=3,stride=2,padding=1)
        )
        self.conv2 = nn.Sequential(
            my_BasicBlock(64,64,[1,1]),
            my_BasicBlock(64,64,[1,1])
        )
        self.conv3 = nn.Sequential(
            my_BasicBlock(64,128,[2,1]),
            my_BasicBlock(128,128,[1,1])
        )

    def forward(self,x):
        y = self.conv1(x)
        y = self.conv2(y)
        y = self.conv3(y)

        return y

resnet = my_resnet18(my_BasicBlock)
print(resnet)
for data,targer in train_dataloader:
    output =resnet(data)
    print(output.size())
    break

在这里，我创建了一个BasicBlock，然后在my_resnet中引用它，（Resnet不全，我只是想看卷积输出的参数格式）但是这样会报错：


TypeError: conv2d() received an invalid combination of arguments - got (ReLU, Parameter, Parameter, tuple, tuple, tuple, int), but expected one of:
 * (Tensor input, Tensor weight, Tensor bias, tuple of ints stride, tuple of ints padding, tuple of ints dilation, int groups)
      didn't match because some of the arguments have invalid types: (!ReLU!, !Parameter!, !Parameter!, !tuple of (int, int)!, !tuple of (int, int)!, !tuple of (int, int)!, int)

提示我输入到卷积层的参数有误，但是我打印出resnet模型，发现他和正确模型是一样的，我想知道这样写代码究竟哪里出问题了

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

4条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
专家-赤兔[在线] 优质创作者: 编程框架技术领域 2024-04-24 20:50
关注
引自免费微信小程序：皆我百晓生

看起来你在定义forward()函数时对ReLU操作符的位置有些误解。在你的代码中，你在添加残差连接之后立即应用了ReLU激活函数，而正确的做法是在整个block的所有卷积操作完成后统一应用ReLU。

为了修复这个问题，你需要将ReLU操作从my_BasicBlock类中的forward()函数中移除，并将其放在y += self.shortcut(x)之后，确保在整个block的计算结束后再进行非线性变换。下面是修改后的代码：

class my_BasicBlock(nn.Module): def __init__(self, in_channal, out_channal, stride): super(my_BasicBlock, self).__init__() self.net = nn.Sequential( nn.Conv2d(in_channal, out_channal, kernel_size=3, stride=stride[0], padding=1), nn.BatchNorm2d(out_channal), nn.ReLU(inplace=True), nn.Conv2d(out_channal, out_channal, kernel_size=3, stride=stride[1], padding=1), nn.BatchNorm2d(out_channal) ) self.shortcut = nn.Sequential() if stride[0] != 1 or in_channal != out_channal: self.shortcut = nn.Sequential( nn.Conv2d(in_channal, out_channal, kernel_size=1, stride=stride[0]), nn.BatchNorm2d(out_channal) ) def forward(self, x): y = self.net(x) y += self.shortcut(x) # 移动ReLU到这里，在整个block计算后应用 y = nn.ReLU()(y) return y

现在，你的代码应该可以正常运行并避免TypeError。如果你还有其他问题或错误，请随时告诉我。
解决 1
无用
评论打赏
分享
举报编辑记录

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

Pytorch搭建ResNet（附完整代码）
2022-12-20 10:09

五年打鱼三年晒网的博客卷积神经网络逐渐取代传统算法...当网络深度增加时，网络准确度出现饱和甚至下降，残差学习的提出解决了网络退化的问题。ResNet通过shortcut的方法让信息跨层传播，被跨越的层拟合的就是shortcut连接的两层之间的残差。
Pytorch图像分类：05使用PyTorch搭建ResNet模型
2024-08-12 10:07

心之所向h的博客【简介】：基于flower_data使用PyTorch搭建ResNet模型进行图片分类【参考】：6.1 ResNet网络结构，BN以及迁移学习详解_哔哩哔哩_bilibili ResNet网络结构详解与模型的搭建_resnet模型结构-CSDN博客 ...
使用PyTorch搭建ResNet50网络
2022-02-25 00:26

爱吃苹果的派大星的博客看过我之前ResNet18和ResNet34搭建的朋友可能想着可不可以把搭建18和34层的方法直接用在50层以上的ResNet的搭建中，我也尝试过。但是ResNet50以上的网络搭建不像是18到34层只要简单修改卷积单元数目就可以完成，...
使用pytorch搭建ResNet并基于迁移学习训练(超详细 |附训练代码)
2023-08-09 16:38

后来后来啊的博客最近在完成学校暑假任务时候,推荐的b站视频中发现了一个非常好的计算机视觉 + pytorch实战的教程，相见恨晚，能让初学者少走很多弯路。因此决定按着up给的教程路线：图像分类→目标检测→…一步步学习用 pytorch ...
使用pytorch搭建resnet50
2023-02-10 09:19

爱挠静香的下巴的博客使用pytorch搭建resnet50
【笔记】使用pytorch搭建Resnet网络
2023-03-24 10:16

Violet_Stray的博客 Resnet网络搭建
利用pytorch搭建ResNet18网络训练CIFAR100数据集
2023-11-10 16:47

MarkAssassin的博客使用ResNet18网络训练CIFAR100数据集，并进行了优化使准确率得到提高
PyTorch 实现ResNet-50算法
2025-01-27 18:26

尹百的博客纵观前面的经典网络如AlexNet和VGG网络模型，人们通过增加网络的深度提高了网络的性能，实现了识别精度的提高，但是这个方法不可避免的带来了参数巨大，训练消耗的成本高的问题，后面人们在实验中还发现了一个...
医疗影像分类从零到一：PyTorch搭建ResNet-50的X光肺炎检测系统.pdf
2025-02-21 11:01

本文介绍的通过PyTorch搭建ResNet-50模型用于X光影像分类，尤其是在肺炎检测领域的应用，是深度学习在医疗领域应用的一个典型案例。它不仅展示了深度学习技术在医疗影像分析方面的巨大潜力，也提供了一种有效的实践...
【ResNet】Pytorch从零构建ResNet18
2022-05-21 15:36

通过图灵测试的人类的博客 Pytorch从零构建ResNet18 ResNet 目前是应用很广的网络基础框架，所以有必要了解一下.本文从简单的ResNet18开始，详细分析了ResNet18的网络结构，并研究BasicBlock的结构。，使得整个结构非常清晰，再之后手工构建...
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
创建了问题 4月24日

pytorch搭建Resnet遇到问题

4条回答 默认 最新

问题事件

4条回答默认最新