4 May 2022 Design of hand detection based on attention and feature enhancement pyramids
Jiao Li, Haodong Sun, Yang Qiao, Zhongyu Li, Sijie Ran, Xuecheng Sun
Author Affiliations +
Abstract

Hand detection plays an important role in human–computer interaction. Because of the convenient and natural advantages of hands, hand detection is increasingly used in virtual reality, remote control, and other fields. However, since the complex background and the diversity of hand postures, the YOLOv4 algorithm for hand detection suffers from low accuracy and robustness. Therefore, A YOLOv4-HAND network, improved from YOLOv4, is proposed to solve the problem. We first use the dilation convolution to build the feature enhancement pyramid that enables the network to expand semantic information. Second, for better detection of different hand scales, we design a multiscale attention module to capture the correlation of channel information within different scales. Third, we design a head that incorporates a spatial attention module to compensate for the network’s lack of spatial contextual location information correlation. Finally, we use soft nonmaximum suppression to reduce the impact of occlusion. The results show that the YOLOv4-HAND detection network can achieve 83.22% and 93.95% mAP on the publicly available datasets Oxford hand and Egohands datasets. Compared with the most recent method, the YOLOv4-HAND network effectively improves the accuracy of hand detection for practical applications.

© 2022 SPIE and IS&T 1017-9909/2022/$28.00 © 2022 SPIE and IS&T
Jiao Li, Haodong Sun, Yang Qiao, Zhongyu Li, Sijie Ran, and Xuecheng Sun "Design of hand detection based on attention and feature enhancement pyramids," Journal of Electronic Imaging 31(3), 033005 (4 May 2022). https://doi.org/10.1117/1.JEI.31.3.033005
Received: 13 December 2021; Accepted: 11 April 2022; Published: 4 May 2022
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Feature extraction

Head

Convolution

Detection and tracking algorithms

Image fusion

Environmental sensing

Matrix multiplication

Back to Top