yolov5 pt转engine的问题

帮忙解决yolov5的pt转engine的问题。
问题是：我标注图片设为1个分类时用yolov5训练，然后将pt转成onnx，再转engine运行都没问题。
转onnx我用的是yolov5原版的export.py，只改了图像尺寸（imgsz）为2400，其它基本没改。
转engine是别人给的一段代码。
当我标注分类为4个类别时，将pt转成onnx没问题，但转engine时报如下错误：

[1,3,300,300,3] and [1,3,300,300,2]).
LOG[1]: /model.24/Add: elementwise inputs must have same dimensions or follow broadcast rules (input dimensions were [1,3,300,300,3] and [1,3,300,300,2]).
LOG[1]: /model.24/Add: elementwise inputs must have same dimensions or follow broadcast rules (input dimensions were [1,3,300,300,3] and [1,3,300,300,2]).
ERROR: onnx2trt_utils.cpp:680 In function elementwiseHelper:
[8] Assertion failed: tensor_ptr->getDimensions().nbDims == maxNbDims && "Failed to broadcast tensors elementwise!"
Parse failed.

当我标注分类为2个类别时，将pt转成onnx没问题，但转engine时报如下错误：
LOG[1]: /model.24/Reshape_1: volume mismatch. Input dimensions [1,3,300,300,6] have volume 1620000 and output dimensions [1,270000,7] have volume 1890000.
ERROR: onnx2trt_utils.cpp:188 In function convertAxis:
[8] Assertion failed: axis >= 0 && axis < nbDims
Parse failed.

export.py中代码如下：

def export_onnx(model, im, file, opset, dynamic, simplify, prefix=colorstr('ONNX:')):
    # YOLOv5 ONNX export
    check_requirements('onnx>=1.12.0')
    import onnx

    LOGGER.info(f'\n{prefix} starting export with onnx {onnx.__version__}...')
    f = file.with_suffix('.onnx')

    output_names = ['output0', 'output1'] if isinstance(model, SegmentationModel) else ['output0']
    if dynamic:
        dynamic = {'images': {0: 'batch', 2: 'height', 3: 'width'}}  # shape(1,3,640,640)
        if isinstance(model, SegmentationModel):
            dynamic['output0'] = {0: 'batch', 1: 'anchors'}  # shape(1,25200,85)
            dynamic['output1'] = {0: 'batch', 2: 'mask_height', 3: 'mask_width'}  # shape(1,32,160,160)
        elif isinstance(model, DetectionModel):
            dynamic['output0'] = {0: 'batch', 1: 'anchors'}  # shape(1,25200,85)

    torch.onnx.export(
        model.cpu() if dynamic else model,  # --dynamic only compatible with cpu
        im.cpu() if dynamic else im,
        f,
        verbose=False,
        opset_version=opset,
        do_constant_folding=True,  # WARNING: DNN inference with torch>=1.12 may require do_constant_folding=False
        input_names=['images'],
        output_names=output_names,
        dynamic_axes=dynamic or None)

    # Checks
    model_onnx = onnx.load(f)  # load onnx model
    onnx.checker.check_model(model_onnx)  # check onnx model

    # Metadata
    d = {'stride': int(max(model.stride)), 'names': model.names}
    for k, v in d.items():
        meta = model_onnx.metadata_props.add()
        meta.key, meta.value = k, str(v)
    onnx.save(model_onnx, f)

    # Simplify
    if simplify:
        try:
            cuda = torch.cuda.is_available()
            check_requirements(('onnxruntime-gpu' if cuda else 'onnxruntime', 'onnx-simplifier>=0.4.1'))
            import onnxsim

            LOGGER.info(f'{prefix} simplifying with onnx-simplifier {onnxsim.__version__}...')
            model_onnx, check = onnxsim.simplify(model_onnx)
            assert check, 'assert check failed'
            onnx.save(model_onnx, f)
        except Exception as e:
            LOGGER.info(f'{prefix} simplifier failure: {e}')
    return f, model_onnx

网上查找说是需要变换写法，但我找不到对应修改位置。
哪位帮看看怎么解决，最好有具体修改方法，我是初学者，只会简单的操作。

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除
收藏举报

12条回答默认最新

CyMylive. 新星创作者: python技术领域 2023-12-08 08:04

关注

结合GPT给出回答如下请题主参考
首先，将 PyTorch 模型转换为 ONNX 格式:

import torch
from torch.autograd import Variable
import torchvision
import onnx
import onnxruntime

# Load the model
model = torch.load('path/to/your/pytorch/model.pth')

# Set the model to inference mode
model.eval()

# Export the model to ONNX format
dummy_input = Variable(torch.randn(1, 3, width, height)) # Set input image dimensions
input_names = ['image']
output_names = ['boxes', 'scores', 'classes'] # Assuming you have 1 class
torch.onnx.export(model, dummy_input, 'path/to/output/onnx/model.onnx', verbose=True, input_names=input_names, output_names=output_names)

然后，将 ONNX 模型转换为 TensorRT engine：

import tensorrt as trt
import pycuda.driver as cuda
import pycuda.autoinit
import numpy as np

TRT_LOGGER = trt.Logger(trt.Logger.WARNING)

def build_engine(onnx_file_path, engine_file_path="path/to/output/tensorrt/engine/tensorrt.engine"):
    """Takes an ONNX file and creates a TensorRT engine to run inference with"""
    with trt.Builder(TRT_LOGGER) as builder, builder.create_network() as network, trt.OnnxParser(network, TRT_LOGGER) as parser:
        builder.max_workspace_size = 1 << 30 # 1GB
        builder.max_batch_size = 1
        # Parse the ONNX file to populate the TensorRT network
        with open(onnx_file_path, 'rb') as model:
            parser.parse(model.read())
        # Generate the TensorRT engine optimized for the target platform
        with builder.build_cuda_engine(network) as engine:
            with open(engine_file_path, "wb") as f:
                f.write(engine.serialize())
            return engine

def allocate_buffers(engine):
    """Allocates memory for input and output buffers on the device"""
    inputs = []
    outputs = []
    bindings = []
    stream = cuda.Stream()
    for binding in engine:
        size = trt.volume(engine.get_binding_shape(binding)) * engine.max_batch_size
        dtype = trt.nptype(engine.get_binding_dtype(binding))
        # Allocate device memory
        buf = cuda.mem_alloc(size * dtype.itemsize)
        bindings.append(int(buf))
        # Append to the appropriate list
        if engine.binding_is_input(binding):
            inputs.append(buf)
        else:
            outputs.append(buf)
    return inputs, outputs, bindings, stream

def do_inference(engine, inputs, outputs, bindings, stream):
    """Runs inference on a TensorRT engine"""
    # Transfer input data to the GPU
    cuda.memcpy_htod_async(inputs[0], np.ascontiguousarray(image), stream)
    # Run inference
    engine.execute_async_v2(bindings=bindings, stream_handle=stream.handle)
    # Transfer predictions back from the GPU
    cuda.memcpy_dtoh_async(outputs[0], output_data, stream)
    # Synchronize the stream
    stream.synchronize()
    # Return the output predictions
    return output_data

最后，您可以加载您的 TensorRT engine 并执行推理：

# Load the TensorRT engine
engine = trt.lite.Engine.deserialize(engine_file_path)

# Allocate memory for input and output buffers
inputs, outputs, bindings, stream = allocate_buffers(engine)

# Load the input image
image = cv2.imread("path/to/your/input/image.jpg")

# Preprocess the input image (e.g. resize, normalize, etc.)

# Run inference
output_data = do_inference(engine, inputs, outputs, bindings, stream)

# Postprocess the output data (e.g. decode bounding boxes, apply NMS, etc.)

希望以上代码可以帮助您将 YOLOv5 的 PyTorch 模型转换为 TensorRT engine，并执行推理。

本回答被题主选为最佳回答 , 对您是否有帮助呢?

查看更多回答(11条)

报告相同问题？

关注问题

Ubuntu 24.04 配置 RTX 环境：YOLOv5 PT 转 Engine 依赖安装全流程（含避坑指南）
2025-10-31 22:28

2401_88241389的博客网上充斥着 PT 转 Engine 的完整流程教程，但没人系统讲过依赖配置的门道。这篇就聚焦最关键的依赖配置，帮你绕开 90% 的启动失败问题。
YOLOv9模型转ONNX,ONNX转TensorRT Engine
2024-07-12 15:18

weixin_46686890的博客 github源码：...yolov9_trt.py:https://github.com/LinhanDai/yolov9-tensorrt/blob/master/yolov9_trt.py TensorRT inference:https://github.com/WongKinYiu/yolov9/issues/143#issueco
基于yoloV5的.pt模型转换为.engine模型，并通过Deepstream加速，可多相机实时传输，采用nvidia的Xavier板子（精一）
2022-07-16 14:49

鼾声鼾语的博客可以生成正确的wts和cfg，让deepstream进行加载生成engine，在deepstream里运行，可以用detect.py加载pt模型，进行检测；导出脚本export.py，设备选择cpu，导出torchscript和onnx，导出的torchscript可以用于detect....
深度学习的模型转换（.pt转换为.engine）
2024-04-26 11:25

书中藏着宇宙的博客 Model is exported as {onnx_model}') 执行转换： pt_model_path = (r'E:\\ultralytics\\ultralytics\\yolov8n.pt') # 模型路径 _yolo8_2_onnx(pt_model_path=pt_model_path) # 执行函数 .onnx-->.engine #!...
windows环境下yolov5使用tensorrt加速，生成engine文件
2023-01-17 14:52

chaohui_chen1024的博客 tensorrt加速yolov5，解决TensorRT: export failure: [WinError 127] 找不到指定的程序。
[废除, YOLO V5 V6.0 起, 自带 .pt 导出 .engine 功能] TensorRT 加速 YOLO V5 模型的推理
2022-10-24 15:12

mrathena的博客博文目录文章目录环境准备 YOLO V5 6.2 PyTorch Cuda 准备代码 TensorRTx For YOLO V5 6.2 OpenCV Cuda TensorRT dirent.h 准备代码从 .pt 生成 .wts 构建并运行 TensorRTx/yolov5, C++ 配置 CLion 初始 ...
pt-＞onnx-＞engine
2023-01-20 14:38

我们是宇宙中最孤独的孩子的博客 yolov5转模型成功的第一步下载并把模型文件复制到这个项目下： https://github.com/linghu8812/yolov5（用这个代码可以转成功）第二步：(或者不要第一步也行，直接在官方的代码执行以下操作) python3 export.py —...
yolov导出engin推理加速
2022-11-08 11:31

6970的博客 yolov5一行命令导出engine格式文件
YOLOv5 OpenVINO IR模型
2022-05-25 16:51

在OpenVINO工具包中，`Model Optimizer`是一个命令行工具，它可以将训练好的深度学习模型（如YOLOv5的.onnx、.pt或.tflite文件）转化为IR格式。这个过程称为模型优化，旨在减少推理时间，提高效率。转化后的IR模型...
YOLOv5转TensorRt加速部署全过程
2022-01-26 14:29

奶茶不加冰的博客本文是在jetson nano 运行yolov5转tensort加速的记录接上文： https://blog.csdn.net/weixin_44312422/article/details/122256752?spm=1001.2014.3001.5502 （本文出现的yolov5均不指YOLOv5源码，是指tensorrtx中专...
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
系统已结题 12月20日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
已采纳回答 12月12日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
创建了问题 12月7日

yolov5 pt转engine的问题

12条回答 默认 最新

问题事件

12条回答默认最新