基于改进YOLOv7的无人机航拍视频西瓜计数方法
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TP391.4;S24

基金项目:

国家自然科学基金资助项目( 61703363, 62272284) ;山西省重点实验室开放课题基金项目( CICIP2022002)


Improved YOLOv7 method for counting watermelons in UAV aerial videos
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    为解决自然环境下西瓜分布不均且遮挡严重导致的人工计数困难问题,该研究提出一种YOLOv7-GCSF模型与DeepSORT算法相融合的无人机视频西瓜自动计数方法。采用GhostConv及C2f模块轻量化YOLOv7模型,以减少模型冗余信息;引入SimAM注意力机制,构建MP-SimAM模块,用于提高模型特征提取能力;替换CIoU为Focal EIoU损失函数,以增加模型收敛性能;在DeepSORT中提出一种掩模撞线机制,用于提高计数精度。结果表明,YOLOv7-GCSF目标检测模型精确率(P)、均值平均精度(mAP0.5)分别达到94.2%、98.2%,相比YOLOv7模型分别提高2.3、0.3个百分点,在模型轻量化方面,较YOLOv7模型浮点运算数下降77.5G,模型参数量、模型大小分别下降0.57M和18.88MB;与传统Tracktor和SORT算法相比,改进的DeepSORT算法跟踪准确率分别提高5.0和13.7个百分点;三白瓜及宁夏硒砂瓜计数结果决定系数为0.93、平均计数精度为96.3%、平均绝对误差为0.77。该方法可有效统计西瓜园西瓜数量,为西瓜产量预测提供一种行之有效的技术途径。

    Abstract:

    To address the difficulties in manual counting for the uneven distribution and severe occlusion of watermelons in natural environments, this study utilizes drones and smartphones to collect videos and images, combined with manual annotation to establish a dataset for Sanbai melons and Ningxia selenium sand melons. A watermelon video automatic counting method based on the YOLOv7-GCSF model and an improved DeepSORT algorithm is proposed. The lightweight YOLOv7 model with GhostConv is enhanced with GBS modules, G-ELAN modules, and G-SPPCSPC modules to increase the model’s detection speed. Some ELAN modules are replaced with the C2f module from YOLOv8 to reduce redundant information. The SimAM attention mechanism is introduced into the MP module of the feature fusion layer to construct the MP-SimAM module, which is used to enhance the model's feature extraction capability. The CIoU loss function is replaced with the faster-converging, lower-loss Focal EIoU loss function to increase the model's convergence speed. In video tracking and counting, a mask collision line mechanism is proposed for more accurate counting of Sanbai melons and Ningxia selenium sand melons. The results show that in terms of object detection: the four improvements to the YOLOv7-GCSF model have all enhanced the model’s performance to some extent. Specifically, compared to the YOLOv7 model, the construction of the MP-SimAM module increased accuracy by 1.5 percentage points, indicating a greater focus on Sanbai melons and Ningxia selenium sand melons. The addition of GhostConv reduced the model size by 28.1MB, demonstrating that the construction of GBS, G-ELAN, and G-SPPCSPC modules effectively reduced the model size and improved detection speed. The incorporation of the C2f module reduced the model's floating-point operations (FLOPs) by 77.5 billion, indicating that the model has eliminated most of the redundant information. The addition of the Focal EIoU loss function significantly increased the model’s convergence speed, indicating further enhancement of the model's learning ability. The improved YOLOv7-GCSF model achieved an accuracy (P) of 94.2% and a mean average precision (mAP0.5) of 98.2%, which is 5.0, 2.3, 21.9, and 14.9 percentage points higher in accuracy and 3.7, 0.3, 4.6, and 9.3 percentage points higher in mean average precision compared to YOLOv5, YOLOv7, Faster RCNN, and SSD, respectively. In terms of model lightweighting, the YOLOv7-GCSF model has seen a decrease of 1.18M and 0.11M in the number of parameters compared to the YOLOv4-Ghostnet and YOLOv7-Slimneck models, respectively. Compared to the original YOLOv7, the YOLOv7-GCSF model has reduced the parameter count and model size by 0.57M and 18.88MB, respectively. In terms of object tracking: the improved DeepSORT multi-object tracking accuracy is 91.2%, and the multi-object tracking precision is 89.6%, which is 5.0 and 13.7 percentage points higher in tracking accuracy and 3.7 and 13.1 percentage points higher in tracking precision compared to Tracktor and SORT, respectively. Comparing the improved model with manual counting results, the determination coefficient for the counting results of Sanbai melons and Ningxia selenium sand melons is 0.93, the average counting accuracy is 96.3%, and the average absolute error is 0.77, indicating that the error between the improved model and manual counting is small. This approach, by enabling effective counting of watermelons in agricultural fields, provides a technical methodology for the forecasting of watermelon yields.

    参考文献
    相似文献
    引证文献
引用本文

殷慧军,王宝丽,景运革,李菊霞,王鹏岭,权高翔,孙婷婷.基于改进YOLOv7的无人机航拍视频西瓜计数方法[J].农业工程学报,2024,40(19):124-134. DOI:10.11975/j. issn.1002-6819.20240407019

YIN Huijun, WANG Baoli, JING Yunge, LI Juxia, WANG Pengling, QUAN Gaoxiang, SUN Tingting. Improved YOLOv7 method for counting watermelons in UAV aerial videos[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE),2024,40(19):124-134. DOI:10.11975/j. issn.1002-6819.20240407019

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2024-04-11
  • 最后修改日期:2024-08-20
  • 录用日期:
  • 在线发布日期: 2024-09-29
  • 出版日期:
文章二维码
您是第位访问者
ICP:京ICP备06025802号-3
农业工程学报 ® 2024 版权所有
技术支持:北京勤云科技发展有限公司