VGG16得到的混淆矩阵错误

这是main代码：

import os
os.environ["KMP_DUPLICATE_LIB_OK"] = "TRUE"
import json

import torch
from torchvision import transforms, datasets
import numpy as np
from tqdm import tqdm
import matplotlib.pyplot as plt
from prettytable import PrettyTable

from model import vgg


class ConfusionMatrix(object):


    def __init__(self, num_classes: int, labels: list):
        self.matrix = np.zeros((num_classes, num_classes))
        self.num_classes = num_classes
        self.labels = labels

    def update(self, preds, labels):
        for p, t in zip(preds, labels):
            self.matrix[p, t] += 1

    def summary(self):
        # calculate accuracy
        sum_TP = 0
        for i in range(self.num_classes):
            sum_TP += self.matrix[i, i]
        acc = sum_TP / np.sum(self.matrix)
        print("the model accuracy is ", acc)

        # precision, recall, F1-score,          specificity
        table = PrettyTable()
        table.field_names = ["label", "Precision", "Recall", "F1-score", "Specificity"] #"Specificity"
        for i in range(self.num_classes):
            TP = self.matrix[i, i]
            FP = np.sum(self.matrix[i, :]) - TP
            FN = np.sum(self.matrix[:, i]) - TP
            TN = np.sum(self.matrix) - TP - FP - FN
            Precision = round(TP / (TP + FP), 3) if TP + FP != 0 else 0.
            Recall = round(TP / (TP + FN), 3) if TP + FN != 0 else 0.
            Specificity = round(TN / (TN + FP), 3) if TN + FP != 0 else 0.
            f1_score = round((2*Precision*Recall)/(Precision+Recall),3)
            table.add_row([self.labels[i], Precision, Recall, f1_score,Specificity])
        print(table)

    def plot(self):
        matrix = self.matrix
        print(matrix)
        plt.imshow(matrix, cmap=plt.cm.Blues)

        # # 设置x轴坐标label
        # plt.xticks(range(self.num_classes), self.labels, rotation=45)
        # # 设置y轴坐标label
        # plt.yticks(range(self.num_classes), self.labels)
        # 设置x轴坐标label为1, 2, 3
        plt.xticks(range(self.num_classes), list(range(1, self.num_classes + 1)), rotation=45)
        # 设置y轴坐标label为1, 2, 3
        plt.yticks(range(self.num_classes), list(range(1, self.num_classes + 1)))
        # 显示colorbar
        plt.colorbar()
        plt.xlabel('True Labels')
        plt.ylabel('Predicted Labels')
        plt.title('Confusion matrix')

        # 在图中标注数量/概率信息
        thresh = matrix.max() / 2
        for x in range(self.num_classes):
            for y in range(self.num_classes):
                # 注意这里的matrix[y, x]不是matrix[x, y]
                info = int(matrix[y, x])
                plt.text(x, y, info,
                         verticalalignment='center',
                         horizontalalignment='center',
                         color="white" if info > thresh else "black")
        plt.tight_layout()
        plt.show()


if __name__ == '__main__':
    device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")
    print(device)

    data_transform = transforms.Compose([transforms.Resize(256),
                                         transforms.CenterCrop(224),
                                         transforms.ToTensor(),
                                         transforms.Normalize([0.485, 0.456, 0.406], [0.229, 0.224, 0.225])])
    # 数据集路径
    data_root = r"C:\Users\yingnuo.DESKTOP-9E5CS2I\Desktop\T1-data"
    assert os.path.exists(data_root), "data path {} does not exist.".format(data_root)

    validate_dataset = datasets.ImageFolder(root=os.path.join(data_root, "test"),
                                            transform=data_transform)

    batch_size = 16
    validate_loader = torch.utils.data.DataLoader(validate_dataset,
                                                  batch_size=batch_size, shuffle=False,
                                                  num_workers=2)
    net = vgg(model_name="vgg16", num_classes=4)
    # load pretrain weights
    model_weight_path = "./weights/best_model.pth"
    assert os.path.exists(model_weight_path), "cannot find {} file".format(model_weight_path)
    weight_dict = torch.load(model_weight_path, map_location=device)
    net.load_state_dict(weight_dict, strict=False)
    net.to(device)

    # read class_indict
    json_label_path = './class_indices.json'
    assert os.path.exists(json_label_path), "cannot find {} file".format(json_label_path)
    json_file = open(json_label_path, 'r')
    class_indict = json.load(json_file)

    labels = [label for _, label in class_indict.items()]
    confusion = ConfusionMatrix(num_classes=4, labels=labels)
    net.eval()
    with torch.no_grad():
        for val_data in tqdm(validate_loader):
            val_images, val_labels = val_data
            outputs = net(val_images.to(device))
            outputs = torch.softmax(outputs, dim=1)
            outputs = torch.argmax(outputs, dim=1)
            confusion.update(outputs.to("cpu").numpy(), val_labels.to("cpu").numpy())
    confusion.plot()
    confusion.summary()

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

4条回答默认最新

专家-赤兔[在线] 全栈领域优质创作者 2024-04-18 16:50

关注

引自免费微信小程序：皆我百晓生

根据您提供的代码和混淆矩阵，看起来存在一些逻辑上的错误。首先，让我们修复这些问题并重新运行您的代码。

问题1：模型输出处理

在for val_data in tqdm(validate_loader):循环里，您应该将模型输出转换为预测标签，而不是取softmax后取最大值。因为混淆矩阵是基于原始类别预测的，而不是概率分布。

修复方法：

outputs = torch.argmax(outputs, dim=1)

改为

outputs = outputs.to("cpu").numpy().argmax(axis=1)

问题2：类别的表示

混淆矩阵中的标签显示为1、2、3、4，而实际的标签列表是通过读取json文件得到的。为了保持一致，我们需要确保混淆矩阵中的标签与实际标签列表匹配。

修复方法： 在confusion.plot()函数内部，将x轴和y轴的标签从整数更改为对应的字符串标签。

修复后的代码：

def plot(self):
    matrix = self.matrix
    print(matrix)
    plt.imshow(matrix, cmap=plt.cm.Blues)

    # 设置x轴坐标label
    plt.xticks(range(self.num_classes), self.labels, rotation=45)
    # 设置y轴坐标label
    plt.yticks(range(self.num_classes), self.labels)
    # 设置x轴坐标label为1, 2, 3
    # plt.xticks(range(self.num_classes), list(range(1, self.num_classes + 1)), rotation=45)
    # 设置y轴坐标label为1, 2, 3
    # plt.yticks(range(self.num_classes), list(range(1, self.num_classes + 1)))
    # 显示colorbar
    plt.colorbar()
    plt.xlabel('True Labels')
    plt.ylabel('Predicted Labels')
    plt.title('Confusion matrix')

    # 在图中标注数量/概率信息
    thresh = matrix.max() / 2
    for x in range(self.num_classes):
        for y in range(self.num_classes):
            # 注意这里的matrix[y, x]不是matrix[x, y]
            info = int(matrix[y, x])
            plt.text(x, y, info,
                     verticalalignment='center',
                     horizontalalignment='center',
                     color="white" if info > thresh else "black")
    plt.tight_layout()
    plt.show()

问题3：计算精度、召回率和F1分数时

当您计算这些指标时，您没有考虑到特殊情况，即分母可能为零的情况。在这种情况下，您可以使用scipy.stats库中的binary_confusion_matrix函数来自动处理这种情况，并返回精确度、召回率和F1分数。

修复方法： 添加from scipy.stats import binary_confusion_matrix到顶部，并替换summary函数为以下内容：

from scipy.stats import binary_confusion_matrix

def summary(self):
    # calculate accuracy
    sum_TP = 0
    for i in range(self.num_classes):
        sum_TP += self.matrix[i, i]
    acc = sum_TP / np.sum(self.matrix)
    print("the model accuracy is ", acc)

    # 使用scipy计算精确度、召回率和F1分数
    cm = binary_confusion_matrix(list(range(1, self.num_classes + 1)),
                                  list(range(1, self.num_classes + 1)), self.matrix.flatten())

    table = PrettyTable()
    table.field_names = ["label", "Precision", "Recall", "F1-score"]
    for i in range(self.num_classes):
        Precision = round(cm[1][i] / (cm[1][i] + cm[0][i]), 3)
        Recall = round(cm[1][i] / (cm[1][i] + cm[2][i]), 3)
        f1_score = round((2 * Precision * Recall) / (Precision + Recall), 3)
        table.add_row([self.labels[i], Precision, Recall, f1_score])
    print(table)

现在，您只需按照上述更改更新代码，并运行程序，您应该会得到一个正确的混淆矩阵和相应的统计摘要。

编辑记录

报告相同问题？

关注问题

Vgg16模型分析图片输出的向量维度可以改变吗？机器学习深度学习自然语言处理
2023-03-18 02:20

回答 5 已采纳参考GPT和自己的思路： Vgg16模型是一个预训练的深度卷积神经网络，它由若干个卷积层和池化层构成。在输入一张图片后，模型会输出一个特征向量，该向量的维度通常为1000，用来表示该图片的分类信息。这
【深度学习】使用自己写的VGG16网络训练精度不提升 pytorch 深度学习神经网络
2022-05-30 00:00

回答 2 已采纳删除最后的softmax层，在内个relu之前加归一化就好了
vgg16每次跑出的结果都相同 tensorflow 深度学习神经网络
2022-04-01 02:02

回答 1 已采纳可以调整参数，多运行几次（cpu跟gpu不影响结果，只是gpu运行快一点）
基于机器学习(Machine Learning)的图像识别技术基本概念、术语、算法原理
2023-08-03 02:30

AI天才研究院的博客全连接层全连接层的操作可以表示为矩阵乘法： y = W x + b y = Wx + b y=Wx+b 其中， W W W是权重矩阵， x x x是输入向量， b b b是偏置向量。 4.2 公式推导过程以下我们将推导卷积神经网络中的反向传播过程，以...
ModuleNotFoundError: No module named 'vgg'明明有VGG，还出现这个错误 python pytorch 神经网络
2021-08-31 20:17

回答 2 已采纳 ?你import了么
为什么在pytorch中使用VGG16不用预训练，自己从头训练猫狗分类，正确率只有74%就上不去了？ pytorch 深度学习神经网络
2021-10-21 23:53

回答 1 已采纳官方pretrain的模型数据集是imagenet，样本数是你这个数据集的很多倍，这样的vgg网络卷积层的参数基本收敛到比较好的一个情况，你直接用来finetune只要稍微微调一下最后的fc层就可以得
torchvision中预训练的VGG16和一般论文里的VGG-VD-16有什么区别 pytorch 图像处理深度学习
2023-04-06 16:16

回答 1 已采纳 VGG16 和 VGG-VD-16 在网络结构上是有区别的。VGG16 是 VGG 网络的一种实现，而 VGG-VD-16 则是 VGG 网络的一种变体。 PyTorch 中的 torchvision
万字长文，探索建筑智能前沿
2020-12-12 22:41

shadowcz007的博客所以RNN最初是为语言处理而开发的。只有少数研究侧重于RNN的应用。比如(Luo, Wang, and Xu 2018)将LSTM网络应用于学习弯曲橡胶棒的材料特性。输入的数据是弯曲杆中80个均匀分布的点和初始材料的高度，输出的数据是...
如何在VGG网络中加入金字塔结构？ pytorch 神经网络
2021-09-01 15:43

回答 1 已采纳 VGG网络加FPN（金子塔结构）实现起来不难，首先看一下VGG的实现：简介VggNet与其pytorch实现_清华和你，要上一个的博客-CSDN博客_pytorch vggnet 目前正在学
对torchvison中VGG19的问题 python pytorch
2022-09-14 22:20

回答 1 已采纳具体实现就是这个feature中，
tensorflow预训练模型input格式错误 python tensorflow 人工智能
2023-02-03 11:52

回答 4 已采纳后续增加numpy解决
【深度智能】：迈向高级时代的人工智能全景指南
2024-09-16 15:20

小李很执着的博客案例解析： 混淆矩阵：在 scikit-learn 中使用混淆矩阵评估分类模型的性能，分析不同类别的分类错误情况。 ROC 曲线：绘制 ROC 曲线并计算 AUC，评估模型在不同阈值下的性能。第二阶段：深度学习 1. 深度学习...
vgg19训练图像分类，分成两类，这样训练出来的网络是否过拟合了？人工智能机器学习深度学习
2021-03-23 15:58

回答 3 已采纳 1k5左右就差不多了，过拟合的话不一定，要看下有没有防止过拟合的手段，如果你的项目是开源项目的话，一般会有这方面的限制的。
基于深度学习的舌苔检测毕设留档.zip
2023-09-30 11:26

同时，可能还会引入混淆矩阵来分析模型的错误类型，以便进一步改进。项目的代码库"Tongue_diagnosis-main"很可能包含了数据预处理脚本、模型定义、训练与评估代码，以及可能的可视化结果。这些代码对于理解整个...
AUTOSAR汽车电子嵌入式编程精讲300篇-基于深度学习的车载总线网络入侵检测（续）
2024-02-24 00:30

格图素书的博客其相应的特征图对比不难发现，时序图的波峰和波谷的变换对应着特征图中行和。卷积层用于提取图片中的特征，池化层主要用于降维，二者在...值矩阵将这些向量值与相应的类形成特征映射，对特征图起到分类的作用，称为。
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
创建了问题 4月18日

悬赏问题

¥300 寻抓云闪付tn组成网页付款链接
¥15 请问Ubuntu要怎么安装chrome呀？
¥15 视频编码十六进制问题
¥15 Xsheii7我安装这个文件的时候跳出来另一个文件已锁定文件的无一部分进程无法访问。这个该怎么解决
¥15 unity terrain打包后地形错位，跟建筑不在同一个位置，怎么办
¥15 FileNotFoundError 解决方案
¥15 uniapp实现如下图的图表功能
¥15 u-subsection如何修改相邻两个节点样式
¥30 vs2010开发 WFP（windows filtering platform）
¥15 服务端控制goose报文控制块的发布问题

VGG16得到的混淆矩阵错误

4条回答 默认 最新

问题事件

悬赏问题

4条回答默认最新