pytorch语义分割实现道路裂纹缺陷检测

使用python做道路裂纹缺陷检测（语义分割），训练模型时一直没有得到正确的结果
训练了8个周期，从第二个周期开始准确率一直都没有变化

第8个周期时，模型输出的结果：

不知道是我处理输出模型输出结果的问题，还是模型本身的问题

这是我的代码

from torch.utils import data
from torch import nn, optim
from torchvision import transforms
import torch
from PIL import Image
from matplotlib import pyplot as plt
import os

device = torch.device("cuda:0")

transformer = transforms.Compose([
    transforms.Resize((256, 256)),
    transforms.ToTensor()
])


def booltf(tensor):
    tensor[tensor >= 0.5] = 1
    tensor[tensor < 0.5] = 0

    return tensor
# 定义dataset
class SegmentationDataset(data.Dataset):
    def __init__(self, img_file, label_file):
        self.imgs = []
        self.labels = []
        for img, label in zip(os.listdir(img_file), os.listdir(label_file)):
            self.imgs.append(os.path.join(img_file, img))
            self.labels.append(os.path.join(label_file, label))

    def __getitem__(self, index):
        img = self.imgs[index]
        label = self.labels[index]
        img = Image.open(img)
        label = Image.open(label)
        img_tensor = transformer(img)
        label_tensor = transformer(label)
        label_tensor = booltf(label_tensor)
        label_tensor = label_tensor.squeeze().long()
        return img_tensor, label_tensor

    def __len__(self):
        return len(self.imgs)

# 使用matplotlib显示图像
def toview(tensor, is_gray=True):
    """
    :param tensor: 传入一个tensor
    :param is_gray: 如果为True，则显示灰度图片，否则显示彩色图片
    """
    img = transforms.ToPILImage()(tensor.float())
    if is_gray:
        plt.imshow(img, cmap='gray')
    else:
        plt.imshow(img)

# 定义Unet模型
class Downsample(nn.Module):
    def __init__(self, in_channels, out_channels):
        super(Downsample, self).__init__()
        self.conv_relu = nn.Sequential(
            nn.Conv2d(in_channels, out_channels, kernel_size=3, padding=1),
            nn.ReLU(),
            nn.Conv2d(out_channels, out_channels, kernel_size=3, padding=1),
            nn.ReLU()
        )
        self.pool = nn.MaxPool2d(kernel_size=2)

    def forward(self, x, is_pool=True):
        if is_pool:
            x = self.pool(x)
        x = self.conv_relu(x)
        return x

class Upsample(nn.Module):
    def __init__(self, channels):
        super(Upsample, self).__init__()
        self.conv_relu = nn.Sequential(
            nn.Conv2d(2 * channels, channels, kernel_size=3, padding=1),
            nn.ReLU(inplace=True),
            nn.Conv2d(channels, channels, kernel_size=3, padding=1),
            nn.ReLU(inplace=True),
        )
        self.upconv = nn.Sequential(
            nn.ConvTranspose2d(channels, channels // 2, kernel_size=3, stride=2, padding=1, output_padding=1)
        )

    def forward(self, x):
        x = self.conv_relu(x)
        x = self.upconv(x)
        return x

class Unet_model(nn.Module):
    def __init__(self):
        super(Unet_model, self).__init__()
        self.down1 = Downsample(3, 64)
        self.down2 = Downsample(64, 128)
        self.down3 = Downsample(128, 256)
        self.down4 = Downsample(256, 512)
        self.down5 = Downsample(512, 1024)

        self.up = nn.Sequential(
            nn.ConvTranspose2d(1024, 512, kernel_size=3, stride=2, padding=1, output_padding=1),
            nn.ReLU()
        )

        self.up1 = Upsample(512)
        self.up2 = Upsample(256)
        self.up3 = Upsample(128)

        self.conv_2 = Downsample(128, 64)
        self.last = nn.Conv2d(64, 2, kernel_size=1)

    def forward(self, input):
        x1 = self.down1(input, is_pool=False)
        x2 = self.down2(x1)
        x3 = self.down3(x2)
        x4 = self.down4(x3)
        x5 = self.down5(x4)

        x5 = self.up(x5)

        x5 = torch.cat([x4, x5], dim=1)

        x5 = self.up1(x5)

        x5 = torch.cat([x3, x5], dim=1)
        x5 = self.up2(x5)
        x5 = torch.cat([x2, x5], dim=1)
        x5 = self.up3(x5)

        x5 = torch.cat([x1, x5], dim=1)
        x5 = self.conv_2(x5, is_pool=False)
        x5 = self.last(x5)

        return x5

img_file = r"./data/CrackForest-dataset-master/image"
label_file = r"./data/CrackForest-dataset-master/groundTruthPngImg"
batch_size = 4

dataset = SegmentationDataset(img_file, label_file)
train_data = data.DataLoader(dataset, batch_size=batch_size, shuffle=True)

net = Unet_model().to(device)
criterion = nn.CrossEntropyLoss()
optimizer = optim.Adam(net.parameters())

accuracy = []

for epoch in range(1, 21):
    correct = 0
    total = 0
    for image, label in train_data:
        image, label = image.to(device), label.to(device)

        out = net(image)
        loss = criterion(out, label)

        optimizer.zero_grad()
        loss.backward()
        optimizer.step()

        with torch.no_grad():
            out = torch.argmax(out, dim=1)
            correct += (out == label).sum().item()
            total += batch_size * 256 * 256
        accuracy.append((correct / total) * 100)
    plt.subplot(1, 3, 1)
    toview(image[0], is_gray=False)
    plt.xlabel('input')
    plt.subplot(1, 3, 2)
    toview(label[0])
    plt.xlabel('label')
    plt.subplot(1, 3, 3)
    toview(out[0])
    plt.xlabel('out')
    plt.show()
    print("epoch:{}, accuracy:{:.4f}%".format(epoch, accuracy[-1]))

思路：先读入image和label，并转化成tensor，因为是二分类的问题并且使用ToTensor后label_tensor的值会转化为[0，1]，所以我设置label_tensor中大于0.5的值统一为1，小于0.5的值统一为1，在代码中我定义了一个函数

def booltf(tensor):
    tensor[tensor >= 0.5] = 1
    tensor[tensor < 0.5] = 0

    return tensor

然后使用nn.CrossEntropyLoss()计算损失，然后再优化函数，更新参数。
再使用torch.argmax(out, dim=1)，将输出的out转换成我想要的二分类的图像，最后通过ToPILImage和plt将图片显示出来

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
youcans_ 人工智能领域优质创作者 2023-01-09 09:30
关注
你用的就这么一张图片，而且输出为 2分类，训练 2个周期以后就很难再有提高了，目前的精度也差不多了。

解决
无用 1
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

SINet语义分割，所有像素预测结果是同一个值 pytorch 深度学习计算机视觉
2022-03-08 15:10

回答 2 已采纳问题已解决如果保存的模型是多GPU训练的，则load_state_dict()之前必须使用torch.nn.DataParallel()来源https://www.cnblogs.com/tofen
pytorch中多分类语义分割的标签用不用改成one-hot形式啊计算机视觉
2023-02-02 10:39

回答 4 已采纳解决方案来自梦想橡皮擦狂飙群基于 GPT 编写的问答软件如果要使用 PyTorch 进行 4 分类语义分割，那么可以根据需要使用单通道索引图形式或四通道 one-hot 编码格式。在单通道索引
pytorch中实现图片由1通道向2通道转变 pytorch 人工智能深度学习
2022-10-06 22:07

回答 2 已采纳 transforms.Grayscale(num_output_channels=2)
AI人工智能深度学习算法：在缺陷检测中的应用
2024-05-30 00:31

光剑书架上的书的博客 AI人工智能深度学习算法：在缺陷检测中的应用 1. 背景介绍 1.1 缺陷检测的重要性在现代制造业中,产品质量是关键因素之一。缺陷检测是确保产品符合预期质量标准的重要环节。传统的人工检测方法存在效率低下、疲劳...
pycharm无法检测到pytorch pycharm python pytorch
2023-01-07 21:39

回答 2 已采纳你这个是在终端里面打开的吧，没有激活环境就会这样。正常右键运行或者Ctrl+shift+F10运行是可以运行的，你这个是在终端，前面才会有ps的字样，表示power shell。你可以输入cmd转到c
目标检测pytorch搭建环境 pycharm pytorch 目标检测
2022-02-05 17:07

回答 2 已采纳你在终端激活而已，那就用命令行运行目前的代码啊，如果你要用pycharm的run运行的话，你需要在设置里面修改下python的解释器，路径就是你下面的虚拟环境
实现pytorch时出现空参数问题 pytorch 机器学习深度学习
2022-10-24 15:49

回答 1 已采纳 int是什么鬼？改成__init__，不然你都没有初始化model，导致你的model就是空的
实战|基于YOLOv10与MobileSAM实现目标检测与分割【附完整源码】
2024-06-05 16:00

阿_旭的博客实战|基于YOLOv10与MobileSAM实现目标检测与分割【附完整源码】
原形网络基于pytorch的实现 python pytorch
2023-03-22 13:23

回答 1 已采纳您可以按照以下步骤使用原形网络训练您自己的样本：将您的数据集转换为pytorch可以使用的数据格式，例如使用torchvision中的ImageFolder或Dataset类。请确保每个类别的样本数
关于pytorch网站上官方实现fcn网络的问题 pytorch 深度学习
2022-04-11 11:22

回答 1 已采纳低版本可手动安装，是否兼容可自行尝试 path '/data/VOCdevkit\VOC2012' does not exist. 报错是文件位置不对，你已经找到在哪里改了，可将--data-path
pytorch训练时cuda内存不足 pytorch 目标检测神经网络
2023-03-02 15:28

回答 2 已采纳通常遇到OOM(out of memory)问题, 只有两种解决方案, 降低您网络训练时的batchsize, 或者选用更小的网络. 看到您这里用的resnet50, 8Gmemory够用了, 您可以
缺陷检测算法综述、源码(传统+深度学习)
2021-05-05 00:06

AI算法网奇的博客文献资料汇总 ... 综述：机器视觉表面缺陷检测综述 缺陷检测工具箱 ... 基于深度学习方式 ...1、语义分割方式 https://github.com/Wslsdx/Deep-Learning-Approach-for-Surface-Defect-Detection https://github...
tensorflow转pytorch实现 python
2022-07-28 17:58

回答 2 已采纳这个跟keras还是pytorch没有任何关系，这就是个one hot，无非就是把类别标签都转为one hot，和框架没有任何关系，比如你有[0,1,2,3]四个类，那么0会由[1,0,0,0]表示，
万字长文细说工业缺陷检测
2021-07-23 11:15

极市平台的博客 [皮特潘：AI 工业缺陷检测 —— 写在前面的话zhuanlan.zhihu.com(https://zhuanlan.zhihu.com/p/375383384) 主要内容还是围绕着场景分析与数据理解、方法论与算法设计、工具链与部署落地等方面进行展开。重点关注的...
一文梳理缺陷检测方法
2021-01-18 12:52

Amusi（CVer）的博客点击上方“CVer”，选择加"星标"置顶重磅干货，第一时间送达本文转载自：AI约读社 | 文末附缺陷检测交流群近年来，随着深度学习的快速发展，基于卷积神经网络(CNN...
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
创建了问题 1月8日

悬赏问题

¥15 如果我想学习C大家有是的的资料吗
¥15 根据文件名称对文件进行排序
¥15 deploylinux的ubuntu系统无法成功安装使用MySQL❓
¥15 有人会用py或者r画这种图吗
¥15 MOD04_3K图像预处理
¥15 [VASP]关于超胞大小和k 点网格的收敛性测试
¥15 pip下载paddle2onnx离谱错误
¥60 db2move nlzxams import 导出db2备份数据报错
¥15 关于#python#的问题：全文总结功能咨询
¥15 俄罗斯方块中无法同时消除多个满行

pytorch语义分割实现道路裂纹缺陷检测

1条回答 默认 最新

问题事件

悬赏问题

1条回答默认最新