使用yolo3识别车牌，输出的车牌号码和识别出的车牌号码不一致

问题遇到的现象和发生背景

问题相关代码，请勿粘贴截图

def __init__(self, **kwargs):
    self.__dict__.update(self._defaults)
    for name, value in kwargs.items():
        setattr(self, name, value)
        
    #---------------------------------------------------#
    #   获得种类和先验框的数量
    #---------------------------------------------------#
    self.class_names, self.num_classes  = get_classes(self.classes_path)
    self.anchors, self.num_anchors      = get_anchors(self.anchors_path)
    self.bbox_util                      = DecodeBox(self.anchors, self.num_classes, (self.input_shape[0], self.input_shape[1]), self.anchors_mask)

    #---------------------------------------------------#
    #   画框设置不同的颜色
    #---------------------------------------------------#
    hsv_tuples = [(x / self.num_classes, 1., 1.) for x in range(self.num_classes)]
    self.colors = list(map(lambda x: colorsys.hsv_to_rgb(*x), hsv_tuples))
    self.colors = list(map(lambda x: (int(x[0] * 255), int(x[1] * 255), int(x[2] * 255)), self.colors))
    self.generate()

#---------------------------------------------------#
#   生成模型
#---------------------------------------------------#
def generate(self):
    #---------------------------------------------------#
    #   建立yolov3模型，载入yolov3模型的权重
    #---------------------------------------------------#
    self.net    = YoloBody(self.anchors_mask, self.num_classes)
    device      = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
    self.net.load_state_dict(torch.load(self.model_path, map_location=device))
    self.net    = self.net.eval()
    print('{} model, anchors, and classes loaded.'.format(self.model_path))

    if self.cuda:
        self.net = nn.DataParallel(self.net)
        self.net = self.net.cuda()

#---------------------------------------------------#
#   检测图片
#---------------------------------------------------#
def detect_image(self, image):
    image_shape = np.array(np.shape(image)[0:2])
    #---------------------------------------------------------#
    #   在这里将图像转换成RGB图像，防止灰度图在预测时报错。
    #   代码仅仅支持RGB图像的预测，所有其它类型的图像都会转化成RGB
    #---------------------------------------------------------#
    image       = cvtColor(image)
    #---------------------------------------------------------#
    #   给图像增加灰条，实现不失真的resize
    #   也可以直接resize进行识别
    #---------------------------------------------------------#
    image_data  = resize_image(image, (self.input_shape[1],self.input_shape[0]), self.letterbox_image)
    #---------------------------------------------------------#
    #   添加上batch_size维度
    #---------------------------------------------------------#
    image_data  = np.expand_dims(np.transpose(preprocess_input(np.array(image_data, dtype='float32')), (2, 0, 1)), 0)

    with torch.no_grad():
        images = torch.from_numpy(image_data)
        if self.cuda:
            images = images.cuda()
        #---------------------------------------------------------#
        #   将图像输入网络当中进行预测！
        #---------------------------------------------------------#
        outputs = self.net(images)
        outputs = self.bbox_util.decode_box(outputs)
        #---------------------------------------------------------#
        #   将预测框进行堆叠，然后进行非极大抑制
        #---------------------------------------------------------#
        results = self.bbox_util.non_max_suppression(torch.cat(outputs, 1), self.num_classes, self.input_shape, 
                    image_shape, self.letterbox_image, conf_thres = self.confidence, nms_thres = self.nms_iou)
                                                
        if results[0] is None: 
            return image

        top_label   = np.array(results[0][:, 6], dtype = 'int32')
        top_conf    = results[0][:, 4] * results[0][:, 5]
        top_boxes   = results[0][:, :4]
    #---------------------------------------------------------#
    #   设置字体与边框厚度
    #---------------------------------------------------------#
    font        = ImageFont.truetype(font='model_data/simhei.ttf', size=30)
    thickness   = int(max((image.size[0]) // np.mean(self.input_shape), 1))
    
    #---------------------------------------------------------#
    #   图像绘制
    #---------------------------------------------------------#
    output_str = []  #
    data = []  #
    filename = 'resultof_CHEPAI.txt'  #
    mode = 'a+'  #
    for i, c in list(enumerate(top_label)):
        predicted_class = self.class_names[int(c)]
        box             = top_boxes[i]
        score           = top_conf[i]

        top, left, bottom, right = box

        top     = max(0, np.floor(top).astype('int32'))
        left    = max(0, np.floor(left).astype('int32'))
        bottom  = min(image.size[1], np.floor(bottom).astype('int32'))
        right   = min(image.size[0], np.floor(right).astype('int32'))

        label = '{}'.format(predicted_class)
        output_str.append(predicted_class)  #
        draw = ImageDraw.Draw(image)
        label_size = draw.textsize(label, font)
        label = label.encode('utf-8')
        #new_label=label.decode(encoding='utf-8')
        #print(new_label,end='')

        
       
       # filename = 'write_data.txt'
       # with open(filename,'a') as f: # 如果filename不存在会自动创建， 'w'表示写数据，写之前会清空文件中的原有数据！
       #     f.write(new_label)

        
        if top - label_size[1] >= 0:
            text_origin = np.array([left, top - label_size[1]])
        else:
            text_origin = np.array([left, top + 1])

        for i in range(thickness):
            draw.rectangle([left + i, top + i, right - i, bottom - i], outline=self.colors[c])
        draw.rectangle([tuple(text_origin), tuple(text_origin + label_size)], fill=self.colors[c])
        draw.text(text_origin, str(label,'UTF-8'), fill=(0, 0, 0), font=font)
        del draw
    output_str1 = output_str[1:6]  #
    output_str2 = output_str[6:8]  #
    output_str2.reverse()  #
    data0 = output_str2 + output_str1  #
    data = "".join(data0)  #
    print(data)  #
    with open(filename, mode=mode) as f:  #
        f.write(data + "\n")  #
        f.close()  #


    return image

def get_FPS(self, image, test_interval):
    image_shape = np.array(np.shape(image)[0:2])
    #---------------------------------------------------------#
    #   在这里将图像转换成RGB图像，防止灰度图在预测时报错。
    #   代码仅仅支持RGB图像的预测，所有其它类型的图像都会转化成RGB
    #---------------------------------------------------------#
    image       = cvtColor(image)
    #---------------------------------------------------------#
    #   给图像增加灰条，实现不失真的resize
    #   也可以直接resize进行识别
    #---------------------------------------------------------#
    image_data  = resize_image(image, (self.input_shape[1],self.input_shape[0]), self.letterbox_image)
    #---------------------------------------------------------#
    #   添加上batch_size维度
    #---------------------------------------------------------#
    image_data  = np.expand_dims(np.transpose(preprocess_input(np.array(image_data, dtype='float32')), (2, 0, 1)), 0)

    with torch.no_grad():
        images = torch.from_numpy(image_data)
        if self.cuda:
            images = images.cuda()
        #---------------------------------------------------------#
        #   将图像输入网络当中进行预测！
        #---------------------------------------------------------#
        outputs = self.net(images)
        outputs = self.bbox_util.decode_box(outputs)
        #---------------------------------------------------------#
        #   将预测框进行堆叠，然后进行非极大抑制
        #---------------------------------------------------------#
        results = self.bbox_util.non_max_suppression(torch.cat(outputs, 1), self.num_classes, self.input_shape, 
                    image_shape, self.letterbox_image, conf_thres=self.confidence, nms_thres=self.nms_iou)
                                                
    t1 = time.time()
    for _ in range(test_interval):
        with torch.no_grad():
            #---------------------------------------------------------#
            #   将图像输入网络当中进行预测！
            #---------------------------------------------------------#
            outputs = self.net(images)
            outputs = self.bbox_util.decode_box(outputs)
            #---------------------------------------------------------#
            #   将预测框进行堆叠，然后进行非极大抑制
            #---------------------------------------------------------#
            results = self.bbox_util.non_max_suppression(torch.cat(outputs, 1), self.num_classes, self.input_shape, 
                        image_shape, self.letterbox_image, conf_thres=self.confidence, nms_thres=self.nms_iou)
                        
    t2 = time.time()
    tact_time = (t2 - t1) / test_interval
    return tact_time

def get_map_txt(self, image_id, image, class_names, map_out_path):
    f = open(os.path.join(map_out_path, "detection-results/"+image_id+".txt"),"w") 
    image_shape = np.array(np.shape(image)[0:2])
    #---------------------------------------------------------#
    #   在这里将图像转换成RGB图像，防止灰度图在预测时报错。
    #   代码仅仅支持RGB图像的预测，所有其它类型的图像都会转化成RGB
    #---------------------------------------------------------#
    image       = cvtColor(image)
    #---------------------------------------------------------#
    #   给图像增加灰条，实现不失真的resize
    #   也可以直接resize进行识别
    #---------------------------------------------------------#
    image_data  = resize_image(image, (self.input_shape[1],self.input_shape[0]), self.letterbox_image)
    #---------------------------------------------------------#
    #   添加上batch_size维度
    #---------------------------------------------------------#
    image_data  = np.expand_dims(np.transpose(preprocess_input(np.array(image_data, dtype='float32')), (2, 0, 1)), 0)

    with torch.no_grad():
        images = torch.from_numpy(image_data)
        if self.cuda:
            images = images.cuda()
        #---------------------------------------------------------#
        #   将图像输入网络当中进行预测！
        #---------------------------------------------------------#
        outputs = self.net(images)
        outputs = self.bbox_util.decode_box(outputs)
        #---------------------------------------------------------#
        #   将预测框进行堆叠，然后进行非极大抑制
        #---------------------------------------------------------#
        results = self.bbox_util.non_max_suppression(torch.cat(outputs, 1), self.num_classes, self.input_shape, 
                    image_shape, self.letterbox_image, conf_thres = self.confidence, nms_thres = self.nms_iou)
                                                
        if results[0] is None: 
            return 

        top_label   = np.array(results[0][:, 6], dtype = 'int32')
        top_conf    = results[0][:, 4] * results[0][:, 5]
        top_boxes   = results[0][:, :4]

    for i, c in list(enumerate(top_label)):
        predicted_class = self.class_names[int(c)]
        box             = top_boxes[i]
        score           = str(top_conf[i])

        top, left, bottom, right = box
        if predicted_class not in class_names:
            continue

        f.write("%s %s %s %s %s %s\n" % (predicted_class, score[:6], str(int(left)), str(int(top)), str(int(right)),str(int(bottom))))

    f.close()
    return

运行结果及报错内容

我的解答思路和尝试过的方法

我想要达到的结果

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
於黾 2021-12-10 15:39
关注
top, left, bottom, right = box
f.write("%s %s %s %s %s %s\n" % (predicted_class, score[:6], str(int(left)), str(int(top)), str(int(right)),str(int(bottom))))
你这获取数据的顺序和你打印的顺序完全不一致啊

本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

基于深度学习的车牌识别算法，其中，车辆检测网络直接使用YOLO侦测.zip
2024-02-20 14:16

还取决于你选择的参数大小）），只需要提供你想检测的包含车牌的图片（任何位置、角度都可以），就可以得到图片中车牌位置的标记和车牌号码的输出，如果你有摄像头，你可以通过训练得到的模型去调用摄像头，通过...
基于YOLO的车牌识别.zip
2025-09-22 09:52

车牌识别是智能交通系统的关键组成部分，能够在实时监控场景中准确快速地检测和识别车牌号码，具有重要的实际应用价值。基于YOLO的车牌识别系统通常包括几个关键模块：图像采集、车牌定位、车牌字符分割、车牌识别...
利用Python实现车牌识别技术
2025-06-07 22:46

车牌识别系统的核心在于通过图像处理和模式识别技术自动提取车牌号码。Python结合OpenCV、Pillow等图像处理库，能够高效完成图像预处理工作，如灰度化、二值化、噪声去除等，从而提升识别准确率。在车牌识别流程中，...
车牌号码识别python+opencv
2018-12-11 09:23

车牌号码识别是计算机视觉领域中的一个重要应用，主要利用图像处理技术来自动提取并识别车辆的车牌信息。在Python中，OpenCV是一个强大的图像处理库，它提供了丰富的功能用于图像分析和处理，非常适合进行车牌识别。...
YOLOv5车牌和人脸识别+检测权重+标注好的数据集
2023-04-16 20:42

1、YOLOv5车牌和人脸识别，含训练好的检测权重，以及PR曲线，loss曲线等等，map达90% 多可检测出车牌的位置，司机脸部区域，是否戴口罩，但不能识别具体的车牌号码。附有一万张车牌人脸检测数据集，有下载链接，标签...
YOLOv7车牌和人脸识别+检测模型+数据集
2023-04-16 20:39

YOLOv7车牌和人脸识别，含训练好的检测权重，以及PR曲线，loss曲线等等，map达90% 多可检测出车牌的位置，司机脸部区域，是否戴口罩，但不能识别具体的车牌号码。附有一万张车牌人脸检测数据集，有下载链接，标签...
Python车牌检测识别代码（感觉还可以）
2019-06-04 18:31

Python车牌检测识别技术是计算机视觉领域的一个重要应用，主要用于自动识别车辆的车牌号码。这个压缩包包含了一些关键的文件和组件，使得我们可以构建一个基本的车牌识别系统。下面将详细介绍这些知识点。首先，`...
车辆识别_TensorRT_LPR_车牌检测_yolov3加_1741774541.zip
2025-03-12 20:40

YOLO（You Only Look Once）模型是一种流行的实时目标检测算法，它能够在图像中快速识别和定位多个对象，因而在车牌识别系统中被广泛采用。YOLOv3是该模型的第三个版本，具有更高的准确率和速度。在本项目中，我们...
车牌识别（基于yolov5）
2022-03-12 13:57

车牌识别技术是计算机视觉领域中的一个重要应用，它主要用于自动检测和识别车辆的车牌号码，广泛应用于交通监控、停车场管理、智能交通系统等多个场景。在这个项目中，我们利用了yolov5这一先进的深度学习框架来实现...
小白都能看懂的——车牌检测与识别(最新版YOLO26快速入门)
2026-02-07 22:42

水中加点糖的博客 YOLO26进行车牌定位与识别。目标检测快速入门，YOLO也太强了！
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
系统已结题 12月18日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
已采纳回答 12月10日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
创建了问题 12月10日

使用yolo3识别车牌，输出的车牌号码和识别出的车牌号码不一致

问题遇到的现象和发生背景

问题相关代码，请勿粘贴截图

运行结果及报错内容

我的解答思路和尝试过的方法

我想要达到的结果

1条回答 默认 最新

问题事件

1条回答默认最新