openvino ir对象检测模型怎么实现摄像头实时检测

问题遇到的现象和发生背景

如题，目前已经写出了检测单张图片的Python程序，但是这个程序要怎么样才能改成'cv2.VideoCapture'这样使用摄像头实时监测的形式呢？有没有朋友能够帮帮忙啊？

问题相关代码，请勿粘贴截图

from __future__ import print_function

import logging as log
import os
import pathlib
import json
import cv2
import numpy as np
from openvino.inference_engine import IENetwork, IECore
import torch
import torchvision
import time



def xywh2xyxy(x):
    # Convert nx4 boxes from [x, y, w, h] to [x1, y1, x2, y2] where xy1=top-left, xy2=bottom-right
    y = torch.zeros_like(x) if isinstance(x, torch.Tensor) else np.zeros_like(x)
    y[:, 0] = x[:, 0] - x[:, 2] / 2  # top left x
    y[:, 1] = x[:, 1] - x[:, 3] / 2  # top left y
    y[:, 2] = x[:, 0] + x[:, 2] / 2  # bottom right x
    y[:, 3] = x[:, 1] + x[:, 3] / 2  # bottom right y
    return y


def non_max_suppression(prediction, conf_thres=0.1, iou_thres=0.6, merge=False, classes=None, agnostic=False):
    """Performs Non-Maximum Suppression (NMS) on inference results

    Returns:
         detections with shape: nx6 (x1, y1, x2, y2, conf, cls)
    """
    prediction=torch.from_numpy(prediction)
    if prediction.dtype is torch.float16:
        prediction = prediction.float()  # to FP32

    nc = prediction[0].shape[1] - 5  # number of classes
    xc = prediction[..., 4] > conf_thres  # candidates

    # Settings
    min_wh, max_wh = 2, 4096  # (pixels) minimum and maximum box width and height
    max_det = 300  # maximum number of detections per image
    time_limit = 10.0  # seconds to quit after
    redundant = True  # require redundant detections
    multi_label = nc > 1  # multiple labels per box (adds 0.5ms/img)

    t = time.time()
    output = [None] * prediction.shape[0]
    for xi, x in enumerate(prediction):  # image index, image inference
        # Apply constraints
        # x[((x[..., 2:4] < min_wh) | (x[..., 2:4] > max_wh)).any(1), 4] = 0  # width-height
        x = x[xc[xi]]  # confidence

        # If none remain process next image
        if not x.shape[0]:
            continue

        # Compute conf
        x[:, 5:] *= x[:, 4:5]  # conf = obj_conf * cls_conf

        # Box (center x, center y, width, height) to (x1, y1, x2, y2)
        box = xywh2xyxy(x[:, :4])

        # Detections matrix nx6 (xyxy, conf, cls)
        if multi_label:
            i, j = (x[:, 5:] > conf_thres).nonzero(as_tuple=False).T
            x = torch.cat((box[i], x[i, j + 5, None], j[:, None].float()), 1)
        else:  # best class only
            conf, j = x[:, 5:].max(1, keepdim=True)
            x = torch.cat((box, conf, j.float()), 1)[conf.view(-1) > conf_thres]

        # Filter by class
        if classes:
            x = x[(x[:, 5:6] == torch.tensor(classes, device=x.device)).any(1)]

        # Apply finite constraint
        # if not torch.isfinite(x).all():
        #     x = x[torch.isfinite(x).all(1)]

        # If none remain process next image
        n = x.shape[0]  # number of boxes
        if not n:
            continue

        # Sort by confidence
        # x = x[x[:, 4].argsort(descending=True)]

        # Batched NMS
        c = x[:, 5:6] * (0 if agnostic else max_wh)  # classes
        boxes, scores = x[:, :4] + c, x[:, 4]  # boxes (offset by class), scores
        i = torchvision.ops.boxes.nms(boxes, scores, iou_thres)
        if i.shape[0] > max_det:  # limit detections
            i = i[:max_det]
        if merge and (1 < n < 3E3):  # Merge NMS (boxes merged using weighted mean)
            try:  # update boxes as boxes(i,4) = weights(i,n) * boxes(n,4)
                iou = box_iou(boxes[i], boxes) > iou_thres  # iou matrix
                weights = iou * scores[None]  # box weights
                x[i, :4] = torch.mm(weights, x[:, :4]).float() / weights.sum(1, keepdim=True)  # merged boxes
                if redundant:
                    i = i[iou.sum(1) > 1]  # require redundancy
            except:  # possible CUDA error https://github.com/ultralytics/yolov3/issues/1139
                print(x, i, x.shape, i.shape)
                pass

        output[xi] = x[i]
        if (time.time() - t) > time_limit:
            break  # time limit exceeded

    return output



device = 'MYRIAD'
input_h, input_w, input_c, input_n = (640, 640, 3, 1)
log.basicConfig(level=log.DEBUG)

# For objection detection task, replace your target labels here.
# label_id_map = {
#     0: "fire",
# }

names=['red light', 'speed 40', 'speed 25', 'stop', 'people']

label_id_map = {index: item for index, item in enumerate(names)}
exec_net = None


def init():
    """Initialize model

    Returns: model

    """
    #model_xml = "/project/train/src_repo/yolov5/runs/exp0/weights/best.xml"
    #model_xml = "/usr/local/ev_sdk/model/openvino/yolov5x_10_best.xml"
    model_xml = "/home/pi/runs/IR/best.xml"
    if not os.path.isfile(model_xml):
        log.error(f'{model_xml} does not exist')
        return None
    model_bin = pathlib.Path(model_xml).with_suffix('.bin').as_posix()
#     log.info("Loading network files:\n\t{}\n\t{}".format(model_xml, model_bin))
    net = IENetwork(model=model_xml, weights=model_bin)

    # Load Inference Engine
#     log.info('Loading Inference Engine')
    ie = IECore()
    global exec_net
    exec_net = ie.load_network(network=net, device_name=device)
#     log.info('Device info:')
#     versions = ie.get_versions(device)
#     print("{}".format(device))
#     print("MKLDNNPlugin version ......... {}.{}".format(versions[device].major, versions[device].minor))
#     print("Build ........... {}".format(versions[device].build_number))

    input_blob = next(iter(net.inputs))
    n, c, h, w = net.inputs[input_blob].shape
    print(n, c, h, w)
    global input_h, input_w, input_c, input_n
    input_h, input_w, input_c, input_n = h, w, c, n

    return net


def process_image(net, input_image):
    """Do inference to analysis input_image and get output

    Attributes:
        net: model handle
        input_image (numpy.ndarray): image to be process, format: (h, w, c), BGR
        thresh: thresh value

    Returns: process result

    """
    t = time.time()
    if not net or input_image is None:
        log.error('Invalid input args')
        return None
#     log.info(f'process_image, ({input_image.shape}')
    ih, iw, _ = input_image.shape

    # --------------------------- Prepare input blobs -----------------------------------------------------
    print('Prepare input blobs start', time.time()-t)
    t = time.time()
    if ih != input_h or iw != input_w:
        input_image = cv2.resize(input_image, (input_w, input_h))
    input_image = cv2.cvtColor(input_image, cv2.COLOR_BGR2RGB)
    input_image = input_image/255
    input_image = input_image.transpose((2, 0, 1))
    images = np.ndarray(shape=(input_n, input_c, input_h, input_w))
    images[0] = input_image

    input_blob = next(iter(net.inputs))
    out_blob = next(iter(net.outputs))
    print('Prepare input blobs finished', time.time()-t)
    t = time.time()
    # --------------------------- Prepare output blobs ----------------------------------------------------
#     log.info('Preparing output blobs')
#     log.info(f"The output_name{net.outputs}")
    #print(net.outputs)
#     output_name = "Transpose_305"
#     try:
#         output_info = net.outputs[output_name]
#     except KeyError:
#         log.error(f"Can't find a {output_name} layer in the topology")
#         return None

#     output_dims = output_info.shape
#     log.info(f"The output_dims{output_dims}")
#     if len(output_dims) != 4:
#         log.error("Incorrect output dimensions for yolo model")
#     max_proposal_count, object_size = output_dims[2], output_dims[3]

#     if object_size != 7:
#         log.error("Output item should have 7 as a last dimension")

    #output_info.precision = "FP32"

    # --------------------------- Performing inference ----------------------------------------------------
#     log.info("Creating infer request and starting inference")
    print('Performing inference start', time.time()-t)
    t = time.time()
    res = exec_net.infer(inputs={input_blob: images})
    print('Performing inference finished', time.time()-t)
    t = time.time()
    # --------------------------- Read and postprocess output ---------------------------------------------
    print('Processing output blobs start', time.time()-t)
    t = time.time()
#     log.info("Processing output blobs")

#     res = res[out_blob]
    data = res[out_blob]

    
    data=non_max_suppression(data, 0.4, 0.5)
    detect_objs = []
    if data[0]==None:
        return json.dumps({"objects": detect_objs})
    else:
        data=data[0].numpy()
        for idx,proposal in enumerate(data):
            if proposal[4] > 0 :
    #             print(proposal)
                confidence = proposal[4]
                xmin = np.int(iw * (proposal[0]/640))
                ymin = np.int(ih * (proposal[1]/640))
                xmax = np.int(iw * (proposal[2]/640))
                ymax = np.int(ih * (proposal[3]/640))
                idx = int(proposal[5])
    #             if label not in label_id_map:
    #                 log.warning(f'{label} does not in {label_id_map}')
    #                 continue
                detect_objs.append({
                    'name': label_id_map[idx],
                    'xmin': int(xmin),
                    'ymin': int(ymin),
                    'xmax': int(xmax),
                    'ymax': int(ymax),
                    'confidence': float(confidence)
                })
        print('Processing output blobs finished', time.time()-t)
        # t = time.time()
        return json.dumps({"objects": detect_objs})


if __name__ == '__main__':
    # Test API
    img = cv2.imread('/home/pi/picar-x/test_images/image5.jpg')
    predictor = init()
    import time
    t = time.time()
    n = 1
    for i in range(n):
        result = process_image(predictor, img)

    print("平均推理时间",(time.time()-t)/n)
    print("FPS", 1/((time.time()-t)/n))
    # log.info(result)
    for obj in json.loads(result)['objects']:
        print(obj)
    time.sleep(1)

运行结果及报错内容

这个程序是可以正常运行并检测出图片中对应的物体的，但是我不明白怎么才能调用摄像头进行实时检测

我的解答思路和尝试过的方法

我尝试将

img = cv2.imread('/home/pi/picar-x/test_images/image5.jpg')

改成

img = cv2.VideoCapture(0)

但是出现报错“AttributeError: 'cv2.VideoCapture' object has no attribute 'shape'”
并且我觉得调用摄像头实时检测的话是不是也要对程序进行循环

我想要达到的结果

把该程序改成调用摄像头进行实时对象检测。

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除
收藏举报

3条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
herosunly Python领域优质创作者 2022-08-07 07:34
关注
获得2.80元问题酬金
麻烦先执行下这个代码，把print的结果发一下，然后再看看下面怎么搞

capture = cv2.VideoCapture(0) possible = capture.set(cv2.CAP_MODE_BGR) print(possible)

那你试试另外三个看看有没有不报错的：

cv2.CAP_MODE_RGB
cv2.CAP_MODE_GRAY
cv2.CAP_MODE_YUYV
解决无用
评论打赏
分享
举报编辑记录

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

yolov5用openvino转成的IR模型 openvino 目标检测
2022-05-06 18:19

回答 1 已采纳这个自己转啊，export.py里面都写好了转换程序的，你安装好在include参数里面加上openvino运行就是了，别人转的版本不一致兼容性会出问题
树莓派+openVINO+二代神经计算棒加速推理YOLOv5模型出错，如何解决？(语言-python) openvino python 有问必答深度学习
2022-03-03 20:30

回答 2 已采纳我没试过openvino，但是这个不是可以直接读取onnx的吗？或者你可以直接从export.py导出openvino的文件你这里的报错应该是说你的blob.shape返回值长度不是4个，但是你期望
pt模型转torchscript模型 pytorch 人工智能深度学习
2022-05-05 21:26

回答 1 已采纳 torch.jit.save — PyTorch 1.11.0 documentation
Windows 上使用LabVIEW AI 工具包 for OpenVINO™部署YOLOv9实现实时目标检测
2024-03-14 13:28

英特尔开发人员专区的博客本文我们将结合之前开发的 LabVIEW AI 工具包for OpenVINO™工具包部署 YOLO9 模型实现实时目标检测。
用8259的IR4引脚做为中断请求的输入端，实现发光二极管偶数位发光二极管从左至右依次点亮;发光二极管奇数位从右至左依次点亮。 stm32 其他
2022-06-02 08:40

回答 1 已采纳 8259中断控制_hasp_Jason的博客-CSDN博客_8259中断控制器实验 8259中断控制实验1 实验目的1. 掌握8259中
我正在使用Antlr4创建一种语言，然后使用该语言生成LLVM IR。我是否需要为访客事件手写LLVM IR？
2019-05-05 17:23

回答 1 已采纳 While learning Antlr4, I used Golang as a target language, so a statement in my toy language li
篮球红外计分器红外遥控部分按键实现不了，如何解决？(语言-c语言) c语言单片机
2022-04-16 18:53

回答 2 已采纳如果编译器支持调试的话，最好使用单步调试去找下问题，和硬件有关的问题，只是看代码的是很难发现的。导致出现你这个问题的原因可能是以下两种，一是你那段计算健值的代码有问题，你可以按下不同的按键，看一下得到
face_mask_detection_openvino：检测人脸并确定人们是否戴着口罩
2022-05-15 15:33

通过这个"face_mask_detection_openvino"项目，我们可以了解到如何将深度学习技术与OpenVINO相结合，实现在边缘设备上的高效、实时的人工智能应用。这对于推动AI在现实世界中的落地，特别是在医疗、安全等领域有着...
如何从数组中删除空对象？ [关闭] laravel php
2017-04-15 13:37

回答 3 已采纳 You can use my code in bottom : print_r(array_filter($linksArray));
c++多个源文件共用一个new动态分配类对象（extern 及new的用法）
2016-08-03 08:49

回答 2 已采纳 1、在 state.h声明全局变量： extern IRSendRev *IR; 2、在state.cpp中定义该全局变量：IRSendRev *IR=new IRSendRev;
如何在php下使用ord（）和chr（）实现char php
2016-03-16 18:58

回答 2 已采纳 Requirements: Allow increment of a string in the same manner as PHP (please see links in the quest
OpenVINO应用案例：部署YOLO模型到边缘计算摄像头
2022-03-05 18:45

同学来啦的博客通过OpenVINO部署YOLO模型到边缘计算摄像头，其实现路径为：训练(YOLO)->转换(OpenVINO)->部署运行(OpenNCC)。二、具体步骤 1、训练YOLO模型 1.1 安装环境依赖有关安装详情请参阅 ...
Pytorch调用bertEncoderbaTypeError: forward() missing 1 required positional argument: 'attention_mask' bert pytorch 深度学习
2022-07-07 15:35

回答 2 已采纳已解决，根本原因是数据格式的问题，在使用bert_encoder之前，需要将数据格式转换为BertData()格式
yolov5 openvino版本
2022-05-21 18:49

本项目将YOLOv5模型转换为OpenVINO可执行格式，以在Windows 10系统上利用Visual Studio 2019进行编译，并通过摄像头进行实时目标检测。 **YOLOv5模型** YOLOv5是由Joseph Redmon等人开发的目标检测模型的最新版本...
算法部署-使用OpenVINO在CPU平台部署实时图像分割算法DeeplabV3-项目源码-优质项目实战.zip
2024-04-26 14:01

1. **模型转换**：首先，你需要将训练好的DeeplabV3模型转换为OpenVINO可读的 Intermediate Representation (IR) 文件格式。这通常通过Model Optimizer工具完成，该工具能够优化模型并生成适用于不同硬件平台的IR...
【YOLOv9】实战一：在 Windows 上使用LabVIEW OpenVINO工具包部署YOLOv9实现实时目标检测（含源码）
2024-03-18 18:03

virobotics的博客 Hello，大家好，我是virobotics（仪酷智能），一个深耕于LabVIEW和人工智能领域的开发工程师。今天一起来看一下如何在 Windows 上使用LabVIEW OpenVINO工具包部署YOLOv9实现实时目标检测
yolov5 openvino2022版本
2022-05-22 00:46

YOLOv5与OpenVINO 2022版本结合使用，可以实现高效且精确的实时对象检测。YOLO（You Only Look Once）是一种流行的深度学习目标检测模型，以其快速和准确的性能而受到广泛关注。而OpenVINO（Open Visual Inference ...
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
系统已结题 8月15日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
创建了问题 8月7日

悬赏问题

¥15 关于Finetune模型，CUDA error: device-side assert triggered 报错
¥15 能将阿里云上多个设备的信息能上传给小程序吗
¥50 QT6.7 Camera预览窗口，camera分辨率设置
¥15 车机是安卓4.4.3，车机没有gps，我想用外置蓝牙gps。所以在用安卓模拟位置服务时候，我下载的相关软件不显示在列表里
¥15 matlab水位控制系统（详解）
¥15 CST软件仿真，已知中心线方程构建图形
¥15 supLink 用户定位
¥15 materials studio中sorption模块得到的吸附量如何计算出绝对吸附量和自由气量
¥15 odoo17生产成品入出库会计日记账设置及自动产生会计分录
¥15 MCU控制20V PWM波输出的控制电路