☆ 4.7 Article

A Multimodal Dynamic Hand Gesture Recognition Based on Radar-Vision Fusion

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT (2023)

期刊

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT

卷 72, 期 -, 页码 -

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TIM.2023.3253906

关键词

Radar; Sensors; Gesture recognition; Feature extraction; Hidden Markov models; Cameras; Reliability; Deep learning; frequency-modulated continuous-wave (FMCW); hand gesture recognition (HGR); millimeter-wave (MMW); multimodal fusion

类别

Engineering, Electrical & Electronic Instruments & Instrumentation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper proposes a multimodal dynamic hand gesture recognition method based on a two-branch fusion deformable network with Gram matching. It effectively improves the adaptability of the classifier to complex environments and exhibits satisfactory robustness to multiple subjects.

Regarding increasingly complex scenarios in hand gesture recognition (HGR), it is challenging to implement a reliable HGR due to the nonadaptability of individual sensors to the environment and the discrepancy of personal habits. Multisensor fusion has been deemed an effective way to overcome the limitations of a single sensor. However, there is a lack of research on HGR to effectively establish bridges linking multimodal heterogeneous information. To address this issue, we propose a novel multimodal dynamic HGR method based on a two-branch fusion deformable network with Gram matching. First, a time-synchronized method is designed to preprocess the multimodal data. Second, a two-branch network is proposed to implement gesture classification based on radar-vision fusion. The input convolution is replaced by the deformable convolution to improve the generalization of gesture motion modeling. The long short-term memory (LSTM) unit is used to extract the temporal features of dynamic hand gestures. Third, Gram matching is presented as a loss function to mine high-dimensional heterogeneous information and maintain the integrity of radar-vision fusion. The experimental results indicate that the proposed method effectively improves the adaptability of the classifier to complex environments and exhibits satisfactory robustness to multiple subjects. Furthermore, ablation analysis shows that deformable convolution and Gram loss not only provide reliable gesture recognition but also enhance the generalization ability of the proposed methods in different field-of-view scenarios.

A Multimodal Dynamic Hand Gesture Recognition Based on Radar-Vision Fusion

期刊

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A Multimodal Dynamic Hand Gesture Recognition Based on Radar-Vision Fusion

期刊

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文