tensorflow 目标检测，获得包围盒常规坐标的定位信息

目标检测ssd和fast rcnn等算法可以识别并定位物体，可是应该如何在框出目标物体时，能显示物体中心或者边框的xy常规坐标，实现了一些代码，但存在问题，求大神帮忙

def run_inference_for_single_image(image, graph):
    with graph.as_default():
        with tf.Session() as sess:
            # 获得图中所有op
            ops = tf.get_default_graph().get_operations()
            # 获得输出op的名字
            all_tensor_names = {output.name for op in ops for output in op.outputs}
            tensor_dict = {}
            for key in [
              'num_detections', 'detection_boxes', 'detection_scores',
              'detection_classes', 'detection_masks'
            ]:
                tensor_name = key + ':0'
                # 如果tensor_name在all_tensor_names中
                if tensor_name in all_tensor_names:
                    # 则获取到该tensor
                    tensor_dict[key] = tf.get_default_graph().get_tensor_by_name(
                      tensor_name)
            if 'detection_masks' in tensor_dict:
        # The following processing is only for single image
                detection_boxes = tf.squeeze(tensor_dict['detection_boxes'], [0])
                detection_masks = tf.squeeze(tensor_dict['detection_masks'], [0])
        # Reframe is required to translate mask from box coordinates to image coordinates and fit the image size.
                real_num_detection = tf.cast(tensor_dict['num_detections'][0], tf.int32)
                detection_boxes = tf.slice(detection_boxes, [0, 0], [real_num_detection, -1])
                detection_masks = tf.slice(detection_masks, [0, 0, 0], [real_num_detection, -1, -1])
                detection_masks_reframed = utils_ops.reframe_box_masks_to_image_masks(
                    detection_masks, detection_boxes, image.shape[1], image.shape[2])
                detection_masks_reframed = tf.cast(
                    tf.greater(detection_masks_reframed, 0.5), tf.uint8)
        # Follow the convention by adding back the batch dimension
                tensor_dict['detection_masks'] = tf.expand_dims(
                    detection_masks_reframed, 0)
            # 图片输入的tensor
            image_tensor = tf.get_default_graph().get_tensor_by_name('image_tensor:0')

            # 传入图片运行模型获得结果
            output_dict = sess.run(tensor_dict,
                             feed_dict={image_tensor: image})

            # 所有的结果都是float32类型的，有些数据需要做数据格式转换
            # 检测到目标的数量
            output_dict['num_detections'] = int(output_dict['num_detections'][0])
            # 目标的类型
            output_dict['detection_classes'] = output_dict[
              'detection_classes'][0].astype(np.uint8)
            # 预测框坐标
            output_dict['detection_boxes'] = output_dict['detection_boxes'][0]
            # 预测框置信度
            output_dict['detection_scores'] = output_dict['detection_scores'][0]
            boxes = np.squeeze(output_dict['detection_boxes'])
            scores = np.squeeze(output_dict['detection_scores'])
            #set a min thresh score, say 0.8
            min_score_thresh = 0.8
            bboxes = boxes[scores > min_score_thresh]
            #get image size
            im_width, im_height = image.size
            final_box = []
            for box in range(bboxes):
                ymin, xmin, ymax, xmax = box
                final_box.append([xmin * im_width, xmax * im_width, ymin * im_height, ymax * im_height])
    return output_dict

#for root,dirs,files in os.walk('test_images/'):
for root,dirs,files in os.walk('test/'):
for image_path in files:
# 读取图片
image = Image.open(os.path.join(root,image_path))
# 把图片数据变成3维的数据，定义数据类型为uint8
image_np = load_image_into_numpy_array(image)
# 增加一个维度，数据变成: [1, None, None, 3]
image_np_expanded = np.expand_dims(image_np, axis=0)
# 目标检测
output_dict = run_inference_for_single_image(image_np_expanded, detection_graph)
# 给原图加上预测框，置信度和类别信息
vis_util.visualize_boxes_and_labels_on_image_array(
image_np,
output_dict['detection_boxes'],
output_dict['detection_classes'],
output_dict['detection_scores'],
category_index,
use_normalized_coordinates=True,
line_thickness=8)

    # 画图
   # print ("box : ", final_box)
    plt.figure(figsize=(12,8))
    plt.imshow(image_np)
    plt.axis('off')
    plt.show()


---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
<ipython-input-24-32205908683b> in <module>
      9         image_np_expanded = np.expand_dims(image_np, axis=0)
     10         # 目标检测
---> 11         output_dict = run_inference_for_single_image(image_np_expanded, detection_graph)
     12         # 给原图加上预测框，置信度和类别信息
     13         vis_util.visualize_boxes_and_labels_on_image_array(

<ipython-input-23-2044b0b101cc> in run_inference_for_single_image(image, graph)
     56             bboxes = boxes[scores > min_score_thresh]
     57             #get image size
---> 58             im_width, im_height = image.size
     59             final_box = []
     60             for box in range(bboxes):

TypeError: 'int' object is not iterable

```![图片说明](https://img-ask.csdn.net/upload/201908/25/1566710175_880656.jpg)![图片说明](https://img-ask.csdn.net/upload/201908/25/1566710262_253891.jpg)

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
CSDN-Ada助手 CSDN-AI 官方账号 2022-10-25 19:26
关注
不知道你这个问题是否已经解决, 如果还没有解决的话:
这篇文章讲的很详细，请看：TensorFlow的报错信息

如果你已经解决了该问题, 非常希望你能够分享一下解决方案, 写成博客, 将相关链接放在评论区, 以帮助更多的人 ^-^
解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

怎么将好几百张图做完目标提取后的结果图按坐标融合？ python 人工智能目标检测
2021-08-16 09:40

回答 1 已采纳按坐标先找到最大最小坐标的边界框矩形，然后把坐标转为瓦片，没有数据的图用空白或者纯黑像素替代，最后按瓦片序号水平、垂直在合成一张大图。最终合成完肯定还是矩形或者正方形
请教yolov7 如何提取检测到的坐标 python 目标检测
2022-11-14 17:45

回答 2 已采纳这里的xyxy=[x1,y1,x2,y2]
求关于xml作为数据集搭建目标检测模型的文章机器学习目标检测计算机视觉
2022-11-01 10:43

回答 2 已采纳如果你像仔细学，那么你得从深度学习开始，深度学习之前的基础没有也得学。论文可以看看yolo系列的论文，从yolov1-yolov4，外加yolov7,这个是一系列YOLO官方的论文，v5没有论文，v6
目标检测YOLO实战应用案例100讲-无人驾驶车辆在复杂环境下的目标检测
2023-08-26 00:30

林聪木的博客 21世纪以来，随着计算机科学和人工智能的迅猛发展，以及国家对新能源汽车的大力扶持，自动驾驶已经成为未来汽车的必备技能，并且国内外高校以及各研究机构对此广泛关注。自动驾驶在人工智能领域，作为比较重要的...
opencv框选目标点（检测发光点）（获取坐标） opencv python 计算机视觉
2022-09-23 16:04

回答 2 已采纳先dilate做一次膨胀操作，然后用findContours找轮廓，不过你这里面有些里的远的可能就不行了,会被认为是两个轮廓，并不能像你想的那样变成一个。
Android 定位坐标过滤算法实现算法
2018-07-11 04:19

回答 1 已采纳 1.位置服务的简介:位置服务,英文翻译为Location-Based Services，缩写为LBS，又称为定位服务或基于位置的服务，融合了GPS定位、移动通信、导航等多种技术，提供与空间位置相关的综
目标识别中图像增强后目标框坐标位置 python 人工智能深度学习
2019-11-09 16:03

回答 2 已采纳 https://blog.csdn.net/uncle_ll/article/details/83930861
目标检测YOLO实战应用案例100讲-基于多特征融合的SSD目标检测
2024-04-08 00:30

林聪木的博客视觉原理得到广泛的应用，利用计算机图像处理技术实现人类视觉任务成为研究的焦点，目标检测作为视觉任务中的基础部分，随深度学习的发展，致使目标检测算法也得到了快速地发展。目标检测是通过对图像中的多目标...
视觉标定——像素坐标系和图像坐标系的转换 opencv 目标检测计算机视觉
2022-09-01 11:21

回答 2 已采纳我觉得如果设计图像处理的话需要小数，比如图像的各种变换，距离测量等
yolov5检测视频如何保存每帧结果？ python 深度学习目标检测
2022-07-19 09:36

回答 1 已采纳你在这里面存肯定是有目标才会进入for循环，把存图的代码提到for循环外面去即可
国内获取基站定位（经纬度坐标）百度
2018-07-24 07:36

回答 3 已采纳基站信息里应该是包含经纬度的，但是要看是伪的还是我们做了便宜修正的。这个应该是由算法可以查到的，所以建议提问的兄弟转而找找这个算法比较靠谱，至少我是知道确实是有实际应用的案例的，你懂的。
CVPR 2022 | 浙大提出Oriented RepPoints：旋转目标检测网络
2022-06-06 23:59

Amusi（CVer）的博客 CV微信技术交流群作者：小海马|已授权转载（源：知乎）编辑：CVerhttps://zhuanlan.zhihu.com/p/511356711一般物体相比，空中...与主流的包围盒方向回归方法不同，本文提出了一种有效的自适应点学习方法，该方法利...
yolov5如何结合超绿法输出质心坐标 python 目标检测
2022-08-02 18:41

回答 2 已采纳你结合是要怎么个结合法？你的yolov5检测的什么东西？yolov5最后生成的是一个矩形框，你要把质心显示到矩形框上面去?
目标检测YOLO实战应用案例100讲-基于深度学习的遥感目标检测算法FPGA部署实现研究
2023-06-19 00:15

林聪木的博客二阶段目标检测网络如RCNN需要生成多个候选区域在进行目标的分类和定位，运行的速度较慢，而单阶段目标检测网络则不需要通过候选区域直接生成目标的分类和定位信息，所以速度较快，本文也选择了基于YOLO系列算法的...
目标检测YOLO实战应用案例100讲-基于卷积神经网络的可见光遥感影像船只目标检测研究
2023-06-14 00:15

林聪木的博客遥感船舰目标检测是图像处理领域中一个关键的研究方向，主要内容为先对遥感船舰目标进行特征信息的选择和提取，再利用得到的抽象信息结合分类算法实现目标检测任务[11,12]。国内外科研人员针对该选题进行深入研究，...
没有解决我的问题, 去提问

悬赏问题

¥20 sub地址DHCP问题
¥15 delta降尺度计算的一些细节，有偿
¥15 Arduino红外遥控代码有问题
¥15 数值计算离散正交多项式
¥30 数值计算均差系数编程
¥15 redis-full-check比较两个集群的数据出错
¥15 Matlab编程问题
¥15 训练的多模态特征融合模型准确度很低怎么办
¥15 kylin启动报错log4j类冲突
¥15 超声波模块测距控制点灯，灯的闪烁很不稳定，经过调试发现测的距离偏大

tensorflow 目标检测，获得包围盒常规坐标的定位信息

1条回答 默认 最新

悬赏问题

1条回答默认最新