如何将YOLOv5中的NMS替换成DIOU NMS?

这是原始nms代码：

def non_max_suppression(prediction, conf_thres=0.25, iou_thres=0.45, classes=None, agnostic=False, multi_label=False,
                        labels=()):
    """Runs Non-Maximum Suppression (NMS) on inference results

    Returns:
         list of detections, on (n,6) tensor per image [xyxy, conf, cls]
    """

    nc = prediction.shape[2] - 5  # number of classes
    xc = prediction[..., 4] > conf_thres  # candidates

    # Settings
    min_wh, max_wh = 2, 4096  # (pixels) minimum and maximum box width and height
    max_det = 300  # maximum number of detections per image
    max_nms = 30000  # maximum number of boxes into torchvision.ops.nms()
    time_limit = 10.0  # seconds to quit after
    redundant = True  # require redundant detections
    multi_label &= nc > 1  # multiple labels per box (adds 0.5ms/img)
    merge = False  # use merge-NMS

    t = time.time()
    output = [torch.zeros((0, 6), device=prediction.device)] * prediction.shape[0]
    for xi, x in enumerate(prediction):  # image index, image inference
        # Apply constraints
        # x[((x[..., 2:4] < min_wh) | (x[..., 2:4] > max_wh)).any(1), 4] = 0  # width-height
        x = x[xc[xi]]  # confidence

        # Cat apriori labels if autolabelling
        if labels and len(labels[xi]):
            l = labels[xi]
            v = torch.zeros((len(l), nc + 5), device=x.device)
            v[:, :4] = l[:, 1:5]  # box
            v[:, 4] = 1.0  # conf
            v[range(len(l)), l[:, 0].long() + 5] = 1.0  # cls
            x = torch.cat((x, v), 0)

        # If none remain process next image
        if not x.shape[0]:
            continue

        # Compute conf
        x[:, 5:] *= x[:, 4:5]  # conf = obj_conf * cls_conf

        # Box (center x, center y, width, height) to (x1, y1, x2, y2)
        box = xywh2xyxy(x[:, :4])

        # Detections matrix nx6 (xyxy, conf, cls)
        if multi_label:
            i, j = (x[:, 5:] > conf_thres).nonzero(as_tuple=False).T
            x = torch.cat((box[i], x[i, j + 5, None], j[:, None].float()), 1)
        else:  # best class only
            conf, j = x[:, 5:].max(1, keepdim=True)
            x = torch.cat((box, conf, j.float()), 1)[conf.view(-1) > conf_thres]

        # Filter by class
        if classes is not None:
            x = x[(x[:, 5:6] == torch.tensor(classes, device=x.device)).any(1)]

        # Apply finite constraint
        # if not torch.isfinite(x).all():
        #     x = x[torch.isfinite(x).all(1)]

        # Check shape
        n = x.shape[0]  # number of boxes
        if not n:  # no boxes
            continue
        elif n > max_nms:  # excess boxes
            x = x[x[:, 4].argsort(descending=True)[:max_nms]]  # sort by confidence

        # Batched NMS
        c = x[:, 5:6] * (0 if agnostic else max_wh)  # classes
        boxes, scores = x[:, :4] + c, x[:, 4]  # boxes (offset by class), scores
        i = torchvision.ops.nms(boxes, scores, iou_thres)  # NMS
        if i.shape[0] > max_det:  # limit detections
            i = i[:max_det]
        if merge and (1 < n < 3E3):  # Merge NMS (boxes merged using weighted mean)
            # update boxes as boxes(i,4) = weights(i,n) * boxes(n,4)
            iou = box_iou(boxes[i], boxes) > iou_thres  # iou matrix
            weights = iou * scores[None]  # box weights
            x[i, :4] = torch.mm(weights, x[:, :4]).float() / weights.sum(1, keepdim=True)  # merged boxes
            if redundant:
                i = i[iou.sum(1) > 1]  # require redundancy

        output[xi] = x[i]
        if (time.time() - t) > time_limit:
            print(f'WARNING: NMS time limit {time_limit}s exceeded')
            break  # time limit exceeded

    return output

DIOU代码：

def bbox_iou(box1, box2, x1y1x2y2=True, GIoU=False, DIoU=False, CIoU=False, eps=1e-7):
    # Returns the IoU of box1 to box2. box1 is 4, box2 is nx4
    box2 = box2.T

    # Get the coordinates of bounding boxes
    if x1y1x2y2:  # x1, y1, x2, y2 = box1
        b1_x1, b1_y1, b1_x2, b1_y2 = box1[0], box1[1], box1[2], box1[3]
        b2_x1, b2_y1, b2_x2, b2_y2 = box2[0], box2[1], box2[2], box2[3]
    else:  # transform from xywh to xyxy
        b1_x1, b1_x2 = box1[0] - box1[2] / 2, box1[0] + box1[2] / 2
        b1_y1, b1_y2 = box1[1] - box1[3] / 2, box1[1] + box1[3] / 2
        b2_x1, b2_x2 = box2[0] - box2[2] / 2, box2[0] + box2[2] / 2
        b2_y1, b2_y2 = box2[1] - box2[3] / 2, box2[1] + box2[3] / 2

    # Intersection area
    inter = (torch.min(b1_x2, b2_x2) - torch.max(b1_x1, b2_x1)).clamp(0) * \
            (torch.min(b1_y2, b2_y2) - torch.max(b1_y1, b2_y1)).clamp(0)

    # Union Area
    w1, h1 = b1_x2 - b1_x1, b1_y2 - b1_y1 + eps
    w2, h2 = b2_x2 - b2_x1, b2_y2 - b2_y1 + eps
    union = w1 * h1 + w2 * h2 - inter + eps

    iou = inter / union
    if GIoU or DIoU or CIoU:
        cw = torch.max(b1_x2, b2_x2) - torch.min(b1_x1, b2_x1)  # convex (smallest enclosing box) width
        ch = torch.max(b1_y2, b2_y2) - torch.min(b1_y1, b2_y1)  # convex height
        if CIoU or DIoU:  # Distance or Complete IoU https://arxiv.org/abs/1911.08287v1
            c2 = cw ** 2 + ch ** 2 + eps  # convex diagonal squared
            rho2 = ((b2_x1 + b2_x2 - b1_x1 - b1_x2) ** 2 +
                    (b2_y1 + b2_y2 - b1_y1 - b1_y2) ** 2) / 4  # center distance squared
            if DIoU:
                return iou - rho2 / c2  # DIoU
            elif CIoU:  # https://github.com/Zzh-tju/DIoU-SSD-pytorch/blob/master/utils/box/box_utils.py#L47
                v = (4 / math.pi ** 2) * torch.pow(torch.atan(w2 / h2) - torch.atan(w1 / h1), 2)
                with torch.no_grad():
                    alpha = v / (v - iou + (1 + eps))
                return iou - (rho2 / c2 + v * alpha)  # CIoU
        else:  # GIoU https://arxiv.org/pdf/1902.09630.pdf
            c_area = cw * ch + eps  # convex area
            return iou - (c_area - union) / c_area  # GIoU
    else:
        return iou  # IoU

第一个问题：原始代码中应该没有使用加权nms是吗？因为merge=False，如果将merge修改为True，应该就是使用了merge加权

第二个问题：如何将NMS替换成DIOU NMS?要求能够运行出来，有代码就行，讲解一下。

第三个问题：另外几种形式的NMS，也是同DIOU一样形式替换吗？

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

10条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
行*云 2021-04-15 08:58
关注
https://github.com/Zzh-tju/yolov5 参考下这个有对比的图 u版本的默认GIOU

本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决 2
无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(9条)

报告相同问题？

关注问题

yolov5中v4.0版本版本如何替换为DIou_nms,以及更改损失函数为ciou pytorch 深度学习目标检测
2021-07-16 11:26

回答 2 已采纳 #utils/general.py def bbox_iou(box1, box2, x1y1x2y2=True, GIoU=False, DIoU=False, CIoU=False, eps=1e
在Tensorrtx的 yolov5.cpp ，置信度在哪里？ c++ 人工智能神经网络
2021-05-12 11:06

回答 2 已采纳 tensort没用过，不知道结果的结构体成员名是什么，不过你可以用下面几个个方法看下结构体名字。 1、在399行这里的Yolo::Detection这里，按F12点进去看看这个的数据结构是什么样的，
验证yolov5报错 python 人工智能深度学习
2022-11-09 14:05

回答 2 已采纳你可以看下这个问题的回答https://ask.csdn.net/questions/7762455你也可以参考下这篇文章：训练yolov5的报错问题
YOLOv5将NMS替换为DIoU-NMS
2022-08-02 21:45

hllyzms的博客基于DIoU作为NMS标准，虽然多了距离这个维度去考虑问题，但和NMS面对的同样的情况是当两个不同的目标本身就靠的很近的时候还是会造成错误的抑制。具体意义在目标检测的预测阶段时，会输出许多候选的anchorbox，其中...
yolov5识别图像的FPS计算问题深度学习计算机视觉
2022-04-28 18:15

回答 2 已采纳 pre-process：图像预处理时间，包括图像保持长宽比缩放和padding填充，通道变换（HWC->CHW）和升维处理等ms inference：推理速度，指预处理之后的图像输入模型到模型输
yolov5用官方detect.py检测只有图片输出无txt输出 python 人工智能有问必答深度学习
2022-04-12 23:41

回答 2 已采纳加上这个参数
mask rcnn如何使用soft-nms？ python tensorflow 深度学习
2020-03-14 22:13

回答 2 已采纳 https://github.com/USERDXZ/soft-nms-keras
目标检测算法——YOLOv5将NMS替换为DIoU-NMS
2022-04-09 10:15

加勒比海带66的博客将NMS替换为DIoU-NMS，初步改进YOLOv5对重叠遮挡目标的识别。
yolov5 提示错误 python 有问必答深度学习
2022-04-11 14:52

回答 3 已采纳 model的img改成im
YOLOv5无法成功读取树莓派通过mjpg传来的视频流 opencv 目标检测计算机视觉
2021-12-16 10:07

回答 2 已采纳试试看http://192.168.1.102:8080/?action=stream你传一个html不是一个网页吗？
运行YOLOv5包里的detect.py后发生错误 python 深度学习目标检测
2022-07-25 09:14

回答 2 已采纳你改成摄像头了，但是你的摄像头找不到就这样了呗。你看下17行的报错，cap就是摄像头，isopen为false报错，说明开启摄像头失败
Pytorch机器学习（八）—— YOLOV5中NMS非极大值抑制与DIOU-NMS等改进
2021-09-07 14:22

lzzzzzzm的博客 Pytorch机器学习（八）—— YOLOV5中NMS非极大值抑制与DIOU-NMS等改进文章目录系列文章目录前言一、pandas是什么？二、使用步骤 1.引入库 2.读入数据总结前言在目标检测的预测阶段时，会...
YOLOV5中NMS的理解和替换（CIOU_NMS）
2022-02-25 23:09

江小白jlj的博客在YOLOv5源码中，原作者使用的是普通的nms，还有附加可开启的加权nms（merge-NMS），我们可以将nms替换为ciou_nms等其他的变种nms。具体操作如下： Pytorch机器学习（八）—— YOLOV5中NMS非极大值抑制与DIOU-NMS...
YOLOv5改进实战 | 更换NMS之Soft-NMS、DIoU_NMS篇
2023-09-22 12:50

w94ghz的博客 替换成以下代码即可 iou = box_iou_for_nms(bboxes[i], bboxes[order[1:]], DIoU=True).squeeze() 修改general.py中的non_max_suppression 三、CIoU-NMS GIoU-NMS、CIoU-NMS、EIoU-NMS修改方式与DIoU-NMS不能说...
YOLOv8 更换NMS：DIoU-NMS、CIoU-NMS、EIoU-NMS、GIoU-NMS、SIoU-NMS和Soft-NMS详解
2024-05-20 02:15

鱼弦的博客非最大值抑制（Non-Maximum Suppression，NMS）是目标检测算法中常用的后处理操作，用于消除预测框中的冗余和重叠，只保留最具代表性的预测框。传统的NMS算法基于IoU（Intersection over Union）度量进行计算，但IoU...
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
请详细说明问题背景 4月8日

悬赏问题

¥30 Matlab打开默认名称带有/的光谱数据
¥50 easyExcel模板动态单元格合并列
¥15 res.rows如何取值使用
¥15 在odoo17开发环境中，怎么实现库存管理系统，或独立模块设计与AGV小车对接？开发方面应如何设计和开发？请详细解释MES或WMS在与AGV小车对接时需完成的设计和开发
¥15 CSP算法实现EEG特征提取，哪一步错了？
¥15 游戏盾如何溯源服务器真实ip?需要30个字。后面的字是凑数的
¥15 vue3前端取消收藏的不会引用collectId
¥15 delphi7 HMAC_SHA256方式加密
¥15 关于#qt#的问题：我想实现qcustomplot完成坐标轴
¥15 下列c语言代码为何输出了多余的空格

如何将YOLOv5中的NMS替换成DIOU NMS?

10条回答 默认 最新

问题事件

悬赏问题

10条回答默认最新