4.6 Article

A Comparative Study: Toward an Effective Convolutional Neural Network Architecture for Sensor-Based Human Activity Recognition

期刊

IEEE ACCESS
卷 10, 期 -, 页码 20547-20558

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2022.3152530

关键词

Feature extraction; Computer architecture; Convolutional neural networks; Analytical models; Deep learning; Convolution; Computational modeling; Human activity recognition; convolutional neural network; CNN architecture; submodules

资金

  1. Japan Society for the Promotion of Science (JSPS) KAKENHI [19K20420]
  2. Grants-in-Aid for Scientific Research [19K20420] Funding Source: KAKEN

向作者/读者索取更多资源

This paper reviews previous studies on deep learning methods in human activity recognition (HAR) and discusses suitable models for feature extraction. Through comparative experiments, it is found that Inception-V3 backbone with cross-channel multi-size convolution transformation performs the best. In the experiments with embedded submodules, it is discovered that not all submodules have a positive effect on accuracy, and SENet shows positive results compared to other submodules. In conclusion, it is crucial to select an appropriate backbone model before applying submodules, and submodules are unnecessary in some cases.
The feature extraction of human activity recognition (HAR) based on sensor data has been studied as a hand-crafted method. The significant feature extraction ability is a key factor in improving the accuracy of HAR. Recently, deep learning methods have been employed for feature extraction. In this paper, we review previous studies on deep learning methods in HAR and discuss suitable models for feature extraction. First, we applied various convolutional neural networks to clarify the effective architecture for HAR. Afterward, we developed advanced models by embedding submodules, such as self-attention and recurrent neural networks, often adopted in recent studies. Comparative experiments on HASC, UCI, and WISDM public datasets showed that Inception-V3, which used cross-channel multi-size convolution transformation, outperformed other backbones. Through comparative experiments after embedding submodules, submodules do not always have a positive effect on accuracy. Compared with other submodules, SENet has a positive effect. We conclude that it is essential to select an appropriate backbone model before applying the submodules, and submodules are unnecessary in some cases.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据