基于YOLOv8n和RexNet的轻量化动态手势识别网络设计
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TP 399

基金项目:

国家自然科学基金资助项目(52175513)


Design of lightweight dynamic gesture recognition network based on YOLOv8n and RexNet
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对手术室环境下的动态手势人机交互问题,提出了一种基于深度学习的手势识别算法YOLO-RexNet。基于YOLOv8n的动态手势检测部分,在网络中融入DualConv卷积模块,利用GhostNetV2重构C2f模块,并应用模型剪枝技术,在减少模型参数量和计算量的同时保持较高的检测精度。实验结果表明,改进后的算法参数量减少了70.9%,计算量降低了69.7%,模型权重大小下降了66.2%,检测耗时降低了41.3%,动态手势识别的准确率达到99%。基于RexNet的手部关键点检测部分,将损失函数替换为Huber_Loss,使关键点的预测更加准确。所提算法在保证动态手势识别精度的同时,满足轻量化要求,实现了对医疗设备的便捷操作。在Jetson orin nano嵌入式边缘设备上部署时,每秒帧数能够达到65,具备良好的应用前景。

    Abstract:

    To address the challenges of dynamic gesture human-computer interaction in the operating room environment, a gesture recognition algorithm named YOLO-RexNet, based on deep learning, was proposed. The dynamic gesture detection component utilized YOLOv8n, incorporated the DualConv convolution module into the network, reconstructed the C2f module with GhostNetV2, and applied model pruning techniques to reduce parameters and computation while maintaining high detection accuracy. Experimental results demonstrate a 70.9% reduction in parameters, a 69.7% reduction in computation, a 66.2% reduction in model size, and a 41.3% decrease in inference latency, with dynamic gesture-recognition accuracy reaching 99%. In the hand key point detection component, the Huber_Loss function was employed to enhance the precision of key point predictions. The proposed algorithm achieves lightweight design while ensuring the accuracy of dynamic gesture recognition, aiming to facilitate the gesture-controlled medical operations. When deployed on Jetson orin nano embedded edge devices, the frames per second can reach 65, which demonstrates practical potential.

    参考文献
    相似文献
    引证文献
引用本文

廖良进,陈晓荣,倪明,李文莎,李柏杨,倪思佳.基于YOLOv8n和RexNet的轻量化动态手势识别网络设计[J].上海理工大学学报,2025,47(5):578-589.

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2024-08-27
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2025-11-21
  • 出版日期:
文章二维码