关于#目标检测#的问题：小目标检测遇到瓶颈，我想问一下在YOLOv5怎么用滑动窗口检测，有相关博文推荐吗(语言-python)

小目标检测遇到瓶颈，总是有检错或者漏检，我想问一下在YOLOv5怎么用滑动窗口检测，有相关博文推荐吗？

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除
收藏举报

1条回答默认最新

Py小郑新星创作者: python技术领域 2023-04-29 17:29

关注

给你写个实例代码，希望采纳：

import torch
import cv2
from PIL import Image
from numpy import random
from matplotlib import pyplot as plt

# Load YOLOv5 model
model = torch.hub.load('ultralytics/yolov5', 'custom', path_or_model='path/to/weights.pt')

# Define input image resolution and stride
img_size = 640  # input image size
stride = img_size // 2  # stride for sliding window

# Load input image
img_path = 'path/to/input/image.jpg'
img = Image.open(img_path)

# Convert image to numpy array
img = cv2.cvtColor(np.array(img), cv2.COLOR_RGB2BGR)

# Define sliding window parameters
window_size = (img_size, img_size)  # window size
overlap = 0.5  # overlap ratio

# Compute number of windows and positions
height, width, _ = img.shape
x_steps = int((width - window_size[0]) / (stride * overlap)) + 1
y_steps = int((height - window_size[1]) / (stride * overlap)) + 1

# Initialize empty detection list
detections = []

# Loop over windows
for i in range(x_steps):
    for j in range(y_steps):
        # Compute window position
        x = i * stride * overlap
        y = j * stride * overlap

        # Crop window from input image
        window = img[y:y+window_size[1], x:x+window_size[0]]

        # Convert window to PIL image
        window = Image.fromarray(window)

        # Run YOLOv5 model on window
        results = model(window)

        # Filter results by confidence threshold and class
        results.filter('class', 0, '>', 'confidence', 0.5)

        # Add window position to detection boxes
        for result in results.xyxy[0]:
            box = result.tolist()
            box[0] += x
            box[1] += y
            box[2] += x
            box[3] += y
            detections.append(box)

# Draw detection boxes on input image
for box in detections:
    x1, y1, x2, y2, conf, cls = box
    color = (random.randint(0, 255), random.randint(0, 255), random.randint(0, 255))
    cv2.rectangle(img, (int(x1), int(y1)), (int(x2), int(y2)), color, 2)

# Show output image
plt.imshow(img[:, :, ::-1])
plt.show()

报告相同问题？

关注问题

卷积神经网络超详细介绍
2018-09-19 10:16

呆呆的猫的博客海量的有标记的训练数据，也就是李飞飞团队提供的大规模有标记的数据集ImageNet计算机硬件的支持，尤其是GPU的出现，为复杂的计算提供了强大的支持算法的改进，包括网络结构加深、数据增强（数据扩充）、ReLU、...
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
已结题（查看结题原因） 7月9日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
创建了问题 4月29日

关于#目标检测#的问题：小目标检测遇到瓶颈，我想问一下 在YOLOv5怎么用滑动窗口检测，有相关博文推荐吗(语言-python)

1条回答 默认 最新

问题事件

关于#目标检测#的问题：小目标检测遇到瓶颈，我想问一下在YOLOv5怎么用滑动窗口检测，有相关博文推荐吗(语言-python)

1条回答默认最新