通过opencv运行yolov5模型打开摄像头进行检测

### 使用 OpenCV 调用 YOLOv5 实现摄像头实时目标检测要通过 OpenCV 和 YOLOv5 模型实现摄像头的目标检测，可以按照以下方法完成。此过程涉及加载预训练的 ONNX 格式的 YOLOv5 模型、读取摄像头帧并执行推理。 #### 1. 加载 YOLOv5 的 ONNX 模型 YOLOv5 提供了导出为 ONNX 格式的功能，这使得它可以通过 OpenCV 进行高效部署。以下是加载模型的关键代码： ```python import cv2 import numpy as np # 初始化网络 model_path = 'yolov5s.onnx' # 替换为你下载好的ONNX模型路径 net = cv2.dnn.readNetFromONNX(model_path) # 设置计算后端和目标设备 (CPU/GPU) net.setPreferableBackend(cv2.dnn.DNN_BACKEND_OPENCV) net.setPreferableTarget(cv2.dnn.DNN_TARGET_CPU) # 或者 DNN_TARGET_CUDA 如果支持GPU ``` 此处 `cv2.dnn.readNetFromONNX` 函数用于加载 ONNX 文件[^3]。 --- #### 2. 配置摄像头捕获为了从摄像头获取实时视频流，需初始化 VideoCapture 对象，并设置其参数以匹配输入需求。 ```python cap = cv2.VideoCapture(0) # 打开默认摄像头 (索引为0) if not cap.isOpened(): raise IOError("无法打开摄像头") frame_width = int(cap.get(cv2.CAP_PROP_FRAME_WIDTH)) frame_height = int(cap.get(cv2.CAP_PROP_FRAME_HEIGHT)) print(f"Camera resolution set to {frame_width}x{frame_height}") ``` 以上代码片段展示了如何配置摄像头分辨率和其他属性。 --- #### 3. 定义前处理函数 YOLOv5 输入通常是一个固定大小的张量（如 640×640）。因此，在将每一帧传递给模型之前，需要对其进行缩放和标准化操作。 ```python def preprocess(frame, input_size=640): blob = cv2.dnn.blobFromImage( frame, scalefactor=1 / 255.0, size=(input_size, input_size), swapRB=True, crop=False ) return blob ``` 该函数负责创建适合神经网络输入的数据格式[^3]。 --- #### 4. 推理与后处理逻辑在获得预测结果之后，还需要解析这些输出并将它们映射回原始图像坐标系中。 ```python class_names = ['person', 'bicycle', 'car', ...] # 类别名称列表应根据实际使用的权重文件定义 def postprocess(outputs, frame, conf_threshold=0.5, nms_threshold=0.4): boxes = [] class_ids = [] confidences = [] for output in outputs: for detection in output: scores = detection[5:] class_id = np.argmax(scores) confidence = scores[class_id] if confidence > conf_threshold: w, h = int(detection[2] * frame.shape[1]), int(detection[3] * frame.shape[0]) x, y = int((detection[0] * frame.shape[1]) - w / 2), int((detection[1] * frame.shape[0]) - h / 2) boxes.append([x, y, w, h]) class_ids.append(class_id) confidences.append(float(confidence)) indices = cv2.dnn.NMSBoxes(boxes, confidences, conf_threshold, nms_threshold) results = [] for i in indices.flatten(): box = boxes[i] label = f'{class_names[class_ids[i]]}: {confidences[i]:.2f}' color = colors[class_ids[i]] cv2.rectangle(frame, (box[0], box[1]), (box[0]+box[2], box[1]+box[3]), color, thickness=2) cv2.putText(frame, label, (box[0], box[1]-10), cv2.FONT_HERSHEY_SIMPLEX, 0.9, color, 2) return frame ``` 这段代码实现了非极大值抑制(NMS)，过滤掉低概率框，并绘制边界框及其标签到原图上[^3]。 --- #### 5. 主循环：实时检测流程最后一步是构建主程序循环来连续捕捉画面并显示带有标注的结果。 ```python while True: ret, frame = cap.read() if not ret: break blob = preprocess(frame) net.setInput(blob) outputs = net.forward() result_frame = postprocess(outputs, frame) cv2.imshow('Object Detection', result_frame) key = cv2.waitKey(1) & 0xFF if key == ord('q'): break cap.release() cv2.destroyAllWindows() ``` 上述脚本会持续更新窗口直到按下键盘上的 Q 键退出。 --- ### 总结综上所述，借助 OpenCV 可轻松集成 YOLOv5 模型至 Python 应用程序之中，从而实现实时对象探测功能。整个方案涵盖了模型导入、数据准备以及最终可视化等多个环节[^1][^2].

创作声明：本文部分内容由AI辅助生成（AIGC），仅供参考

下一篇 EWT: Efficient Wavelet-Transformer for Single Image Denoising 源码