deformable detr怎么直接使用GitHub上训练好的保存的那个权重的文件来进行物体检测，非常想得到你的回复

我想请问一下，deformable detr怎么直接使用GitHub上训练好的保存的那个权重的文件来进行物体检测，谢谢，非常想得到你的回复

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除
收藏举报

1条回答默认最新

Alaso_soso 2022-06-26 14:14

关注

编写一个detect.py文件，使用预训练模型。
https://www.jianshu.com/p/b364534fd0a7
上面时原作者的内容，可以进行参考，感觉很不错，代码可能需要改一点点，不多，很简单，希望可以帮到你


```python
import cv2
from PIL import Image
import numpy as np
import os
import time

import torch
from torch import nn
# from torchvision.models import resnet50
import torchvision.transforms as T
from main import get_args_parser as get_main_args_parser
from models import build_model

torch.set_grad_enabled(False)

device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")
print("[INFO] 当前使用{}做推断".format(device))

# 图像数据处理
transform = T.Compose([
    T.Resize(800),
    T.ToTensor(),
    T.Normalize([0.485, 0.456, 0.406], [0.229, 0.224, 0.225])
])


# 将xywh转xyxy
def box_cxcywh_to_xyxy(x):
    x_c, y_c, w, h = x.unbind(1)
    b = [(x_c - 0.5 * w), (y_c - 0.5 * h),
         (x_c + 0.5 * w), (y_c + 0.5 * h)]
    return torch.stack(b, dim=1)


# 将0-1映射到图像
def rescale_bboxes(out_bbox, size):
    img_w, img_h = size
    b = box_cxcywh_to_xyxy(out_bbox)
    b = b.cpu().numpy()
    b = b * np.array([img_w, img_h, img_w, img_h], dtype=np.float32)
    return b


# plot box by opencv
def plot_result(pil_img, prob, boxes, save_name=None, imshow=False, imwrite=False):
    LABEL = ['all','hat', 'person', 'groundrod', 'vest', 'workclothes_clothes', 'workclothes_trousers', 'winter_clothes',
             'winter_trousers', 'noworkclothes_clothes', 'noworkclothes_trousers', 'height', 'safteybelt', 'smoking',
             'noheight', 'fire', 'extinguisher', 'roll_workclothes', 'roll_noworkclothes', 'insulating_gloves', 'car',
             'fence', 'bottle', 'shorts', 'holes', 'single_ladder', 'down', 'double_ladder', 'oxygen_horizontally',
             'oxygen_vertically', 'acetylene_vertically', 'acetylene_horizontally']

    len(prob)
    opencvImage = cv2.cvtColor(np.array(pil_img), cv2.COLOR_RGB2BGR)


    if len(prob) == 0:
        print("[INFO] NO box detect !!! ")
        if imwrite:
            if not os.path.exists("./result/pred_no"):
                os.makedirs("./result/pred_no")
            cv2.imwrite(os.path.join("./result/pred_no", save_name), opencvImage)
        return

    for p, (xmin, ymin, xmax, ymax) in zip(prob, boxes):
        cl = p.argmax()
        label_text = '{}: {}%'.format(LABEL[cl], round(p[cl] * 100, 2))

        cv2.rectangle(opencvImage, (int(xmin), int(ymin)), (int(xmax), int(ymax)), (255, 255, 0), 2)
        cv2.putText(opencvImage, label_text, (int(xmin) + 10, int(ymin) + 30), cv2.FONT_HERSHEY_SIMPLEX, 1,
                    (255, 255, 0), 2)

    if imshow:
        cv2.imshow('detect', opencvImage)
        cv2.waitKey(0)

    if imwrite:
        if not os.path.exists("./result/pred"):
            os.makedirs('./result/pred')
        cv2.imwrite('./result/pred/{}'.format(save_name), opencvImage)

def load_model(model_path , args):

    model, _, _ = build_model(args)
    model.cuda()
    model.eval()
    state_dict = torch.load(model_path)  # <-----------修改加载模型的路径
    model.load_state_dict(state_dict["model"])
    model.to(device)
    print("load model sucess")
    return model

# 单张图像的推断
def detect(im, model, transform, prob_threshold=0.7):
    # mean-std normalize the input image (batch-size: 1)
    img = transform(im).unsqueeze(0)

    # demo model only support by default images with aspect ratio between 0.5 and 2
    # if you want to use images with an aspect ratio outside this range
    # rescale your image so that the maximum size is at most 1333 for best results
    
    #assert img.shape[-2] <= 1600 and img.shape[
    #                                     -1] <= 1600, 'demo model only supports images up to 1600 pixels on each side'

    # propagate through the model
    img = img.to(device)
    start = time.time()
    outputs = model(img)
    #end = time.time()
    # keep only predictions with 0.7+ confidence
    # print(outputs['pred_logits'].softmax(-1)[0, :, :-1])
    probas = outputs['pred_logits'].softmax(-1)[0, :, :-1]
    keep = probas.max(-1).values > prob_threshold
    #end = time.time()

    probas = probas.cpu().detach().numpy()
    keep = keep.cpu().detach().numpy()

    # convert boxes from [0; 1] to image scales
    bboxes_scaled = rescale_bboxes(outputs['pred_boxes'][0, keep], im.size)
    end = time.time()
    return probas[keep], bboxes_scaled, end - start


if __name__ == "__main__":
    
    main_args = get_main_args_parser().parse_args()
#加载模型
    dfdetr = load_model('exps/r50_deformable_detr/checkpoint0049.pth',main_args)

    files = os.listdir("coco/testdata/test2017")

    cn = 0
    waste=0
    for file in files:
        img_path = os.path.join("coco/testdata/test2017", file)
        im = Image.open(img_path)

        scores, boxes, waste_time = detect(im, dfdetr, transform)
        plot_result(im, scores, boxes, save_name=file, imshow=False, imwrite=True)
        print("{} [INFO] {} time: {} done!!!".format(cn,file, waste_time))

        cn+=1
        waste+=waste_time
    waste_avg = waste/cn
    print(waste_avg)

```

本回答被题主选为最佳回答 , 对您是否有帮助呢?

报告相同问题？

关注问题

Deformable DETR 实战（训练及预测）
2022-07-23 20:56

dystsp的博客 Deformable DETR的训练及预测
DEFORMABLE DETR: DEFORMABLE TRANSFORMERS FOR END-TO-END OBJECT DETECTION——用于端到端物体检测的可变形注意力机制
2023-09-26 11:26

Joney Feng的博客 0.摘要 1.引言 2.相关工作 3.回顾Transformers和DETR...4.2.关于Deformable DETR模型的补充改进和变体 5.实验 5.1.与DETR的比较 5.2.关于可变形注意力的消融研究 5.3.与当前主流方法的比较 6.结论 7.参考文献
【Deformable DETR 论文+源码解读】Deformable Transformers for End-to-End Object Detection
2022-11-21 11:33

满船清梦压星河HK的博客 deformable detr
【DETR】2、Deformable DETR | 使用多尺度可变形 attention 的方式来解决 DETR 收敛慢和小目标不好的问题
2021-10-26 18:32

呆呆的猫的博客本文主要介绍 Deformable DETR
目标检测：Deformable DETR: Deformable Transformers for End-to-End Object Detection【方法解读】
2024-07-14 20:03

沉浸式AI的博客 Deformable DETR: Deformable Transformers for End-to-End Object Detection论文方法解读
mmdetection训练detr或deformable detr时mAP为0的问题
2024-12-18 19:10

masterMono的博客，上下载mmdetection官方提供的detr-r50预训练权重，修改配置文件加载它。也有人说是num_classes设置不对，我正确设置后也未能解决。数据集设置的也没有问题，刚刚还跑过SSD和YOLOX。出现图中情况，即使为第一轮也不...
PaddlePaddle镜像能否运行Deformable DETR做目标检测？
2025-12-27 00:58

13572025090的博客 Deformable DETR凭借高效注意力机制成为目标检测新利器，结合PaddlePaddle官方Docker镜像与PaddleDetection工具库，可实现开箱即用的训练与部署。从环境搭建、模型配置到推理导出，全流程已高度封装，兼顾性能与工程...
Deformable DETR源码——超详细图解
2024-03-22 14:44

凯尔哥的博客网上也有很多大佬讲deformable DETR 的原理和源码，但是我觉得纸上得来终觉浅，还是要自己一行代码一行代码地 debug，才能理解得更加透彻。于是有了这篇博客，方便自己后续复习。
【论文带读（2）】《Deformable DETR: Deformable Transformers for End-to-End Object Detection》详细带读+笔记+翻译
2025-02-23 19:11

CyanM_的博客自从DETR被提出，告诉大家我们可以把Transformer引入目标检测里，许多新的...今天我们一起来逐行精读的论文《Deformable DETR: Deformable Transformers for End-to-End Object Detection》就是在DETR的基础上改进的。
Deformable DETR
2025-01-30 21:50

麦麦Max的博客问题：DETR 缺陷：收敛速度慢；小物体检测性能低解决方案：可变形注意力模块：仅关注参考点周围的一小部分关键采样点；支持多尺度融合，无需依赖FPN等金字塔网络。
Deformable DETR要点解读
2021-07-13 12:23

Nick Blog的博客最近整理Transformer和set prediction相关的检测&实例分割文章，感兴趣的可以跟一下： DETR: End-to-End Object Detection with Transformers Deformable DETR Rethinking Transformer-based Set Prediction ...
论文阅读：Deformable DETR: Deformable Transformers for End-to-End Object Detection
2024-07-28 21:52

fishfuck的博客最近提出了DETR，以消除对象检测中对许多手工设计组件的需求，同时表现出...Deformable DETR可以在训练时间减少10倍的情况下实现比DETR更好的性能（尤其是在小物体上）。对COCO基准的广泛实验证明了我们方法的有效性。
DEFORMABLE DETR学习笔记
2022-11-16 21:13

麻花地的博客 DETR最近被提出，以消除在目标检测中需要许多手工设计的组件，同时...Deformable 的DETR可以比DETR(特别是在小物体上)获得更好的性能，且训练时间少10倍。在COCO基准上的大量实验证明了我们方法的有效性。代码发布在。
Deformable DETR: DEFORMABLE TRANSFORMERSFOR END-TO-END OBJECT DETECTION（论文阅读）
2021-11-21 16:04

酉意铭的博客 Deformable DETR 是商汤Jifeng Dai 团队于2021年发表在ICLR 上的文章，是针对Detr 的改进。论文：《DEFORMABLE DETR: DEFORMABLE TRANSFORMERS FOR END-TO-END OBJECT DETECTION》论文链接：...
DEFORMABLE DETR 论文精度，并解析网络模型结构
2022-07-21 22:27

您先生的博客 DETR最近被提出以消除在目标检测中对许多手工设计的组件的...DeformableDETR在比DETR少10倍的训练次数下可以获得比DETR(尤其是在小物体上)更好的性能。在COCO基准测试集上的大量实验证明了我们方法的有效性。https。...
【目标检测】手把手教你如何使用MMDetection训练自己的数据集
2024-09-04 12:10

好喜欢吃红柚子的博客参考： MMDetection全流程实战指南：手把手带你构建目标检测模型 2. 安装GPU版本的PyTorch 这里如果安装失败了需要去官网 pytorch官网找对应的版本下载；先输入nvidia-smi命令查看可下载的cuda的最高版本我的可...
【VLMs篇】11：用于端到端目标检测的可变形Transformers(Deformable DETR)
2025-12-25 13:56

J_Xiong0117的博客 DETR 作为端到端目标检测器，虽消除了手工设计组件，但面临收敛慢（500 epochs）和小目标检测差...实验表明，Deformable DETR 在 COCO 基准上仅需 50 个 epoch（10倍加速）即可超越 DETR，且大幅提升了小目标检测精度。
像教女朋友一样的Deformable DETR论文精度+代码详解
2024-07-02 18:19

CV视觉的博客关于Deformable DETR的通俗讲解。
CO-DETR利用coco数据集训练和推理过程
2024-06-28 19:59

多喝开水少熬夜的博客环境：PyTorch 1.11.0 Python 3.8(ubuntu20.04) Cuda 11.3先是在github上下载CO-DETR模型!然后加载所需库!安装mmcv等（注意mmcv应该是1.6.1版本及以上）!!!因为出现了mmdetection 报错 TypeError: FormatCode() got ...
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
已结题（查看结题原因） 6月26日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
已采纳回答 6月26日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
创建了问题 6月25日

deformable detr怎么直接使用GitHub上训练好的保存的那个权重的文件来进行物体检测，非常想得到你的回复

1条回答 默认 最新

问题事件

1条回答默认最新