4.6 Article

Human activity recognition based on smartphone and wearable sensors using multiscale DCNN ensemble

期刊

NEUROCOMPUTING
卷 444, 期 -, 页码 226-243

出版社

ELSEVIER
DOI: 10.1016/j.neucom.2020.04.151

关键词

Human activity recognition; Multimodal data; CNN ensemble; Multiscale temporal data

资金

  1. National Council for Scientific and Technological Development - CNPq [438629/2018-3, 309953/2019-7]
  2. Minas Gerais Research Foundation - FAPEMIG [APQ-00567-14, PPM-00540-17]
  3. Coordenacao de Aperfeicoamento de Pessoal de Nivel Superior - Brasil (CAPES) [001]

向作者/读者索取更多资源

Sensor-based Human Activity Recognition (HAR) plays a significant role in various real-world applications by extracting features from individual sensors and patterns from multiple temporal scales of data to improve recognition accuracy.
Sensor-based Human Activity Recognition (sensor-based HAR) has been used in many real-world applications providing valuable knowledge to many areas, such as human-object interaction, medical, military and security. Recently, wearable devices have progressively gained momentum due to their relevant data provided by their sensors, which could be employed in sensor-based HAR. In addition, the large number of sensors present in these devices provides complementary data since each sensor provides distinct information. However, there are two main issues: data heterogeneity between multiple sensors and the temporal nature of the sensor data. To cope with the former issue, we process each sensor separately, learning their features and performing the classification before fusing with the other sensors. To exploit the latter issue, we use an approach to extract patterns in multiple temporal scales of the data, using an ensemble of Deep Convolution Neural Networks (DCNN). This is convenient since the data are already a temporal sequence and the multiple scales extracted provide meaningful information regarding the activities performed by the users. Consequently, our approach is able to extract both simple movement patterns, such as a wrist twist when picking up a spoon and complex movements, such as the human gait. This multimodal and multi-temporal approach outperforms previous state-of-the-art works in seven important datasets using two different protocols. Finally, we demonstrate that our proposed set of kernels improves sensor-based HAR in another multi-kernel approach, the widely employed inception network. (c) 2020 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Artificial Intelligence

A mid-level video representation based on binary descriptors: A case study for pornography detection

Carlos Caetano, Sandra Avila, William Robson Schwartz, Silvio Jamil F. Guimaraes, Arnaldo de A. Araujo

NEUROCOMPUTING (2016)

Article Computer Science, Artificial Intelligence

Summarizing video sequence using a graph-based hierarchical approach

Luciana dos Santos Belo, Carlos Antonio Caetano, Zenilton Kleber Goncalves do Patrocinio, Silvio Jamil Ferzoli Guimaraes

NEUROCOMPUTING (2016)

Article Engineering, Electrical & Electronic

Histograms of Optical Flow Orientation and Magnitude and Entropy to Detect Anomalous Events in Videos

Rensso Victor Hugo Mora Colque, Carlos Caetano, Matheus Toledo Lustosa de Andrade, William Robson Schwartz

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2017)

Article Computer Science, Information Systems

Magnitude-Orientation Stream network and depth information applied to activity recognition

Carlos Caetano, Victor H. C. de Melo, Francois Bremond, Jefersson A. dos Santos, William Robson Schwartz

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Pixel-level Class-Agnostic Object Detection using Texture Quantization

Gabriel R. Goncalves, Jessica Sena, William Robson Schwartz, Carlos Antonio Caetano

Summary: Object detection is a widely studied topic in computer vision research and is essential for systems involving visual scene understanding. As technology advances, more challenging issues in object detection, such as class-agnostic object detection, have emerged. This paper addresses the task of class-agnostic object detection using a convolutional network and texture graylevel quantization. The results show a significant improvement compared to the baseline in detecting objects without determining their classes.

2022 35TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2022) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Skeleton Image Representation for 3D Action Recognition based on Tree Structure and Reference Joints

Carlos Caetano, Francois Bremond, William Robson Schwartz

2019 32ND SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI) (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Object-based Temporal Segment Relational Network for Activity Recognition

Victor H. C. Melo, Jesimon B. Santos, Carlos Caetano, Jessica Sena, Otavio A. B. Penatti, William Robson Schwartz

PROCEEDINGS 2018 31ST SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI) (2018)

Proceedings Paper Computer Science, Artificial Intelligence

Activity Recognition based on a Magnitude-Orientation Stream Network

Carlos Caetano, Victor H. C. de Melo, Jefersson A. dos Santos, William Robson Schwartz

2017 30TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Histograms of Optical Flow Orientation and Magnitude to Detect Anomalous Events in Videos

Rensso Victor Hugo Mora Colque, Carlos Antonio Caetano Junior, William Robson Schwartz

2015 28TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (2015)

Proceedings Paper Computer Science, Artificial Intelligence

Graph-based hierarchical video summarization using global descriptors

Luciana Belo, Carlos Caetano, Zenilton Patrocinio, Silvio Guimaraes

2014 IEEE 26TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI) (2014)

Proceedings Paper Engineering, Electrical & Electronic

PORNOGRAPHY DETECTION USING BOSSANOVA VIDEO DESCRIPTOR

Carlos Caetano, Sandra Avila, Silvio Guimaraes, Arnaldo de A. Araujo

2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) (2014)

Article Computer Science, Artificial Intelligence

3D-KCPNet: Efficient 3DCNNs based on tensor mapping theory

Rui Lv, Dingheng Wang, Jiangbin Zheng, Zhao-Xu Yang

Summary: In this paper, the authors investigate tensor decomposition for neural network compression. They analyze the convergence and precision of tensor mapping theory, validate the rationality of tensor mapping and its superiority over traditional tensor approximation based on the Lottery Ticket Hypothesis. They propose an efficient method called 3D-KCPNet to compress 3D convolutional neural networks using the Kronecker canonical polyadic (KCP) tensor decomposition. Experimental results show that 3D-KCPNet achieves higher accuracy compared to the original baseline model and the corresponding tensor approximation model.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Personalized robotic control via constrained multi-objective reinforcement learning

Xiangkun He, Zhongxu Hu, Haohan Yang, Chen Lv

Summary: In this paper, a novel constrained multi-objective reinforcement learning algorithm is proposed for personalized end-to-end robotic control with continuous actions. The approach trains a single model using constraint design and a comprehensive index to achieve optimal policies based on user-specified preferences.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Overlapping community detection using expansion with contraction

Zhijian Zhuo, Bilian Chen, Shenbao Yu, Langcai Cao

Summary: In this paper, a novel method called Expansion with Contraction Method for Overlapping Community Detection (ECOCD) is proposed, which utilizes non-negative matrix factorization to obtain disjoint communities and applies expansion and contraction processes to adjust the degree of overlap. ECOCD is applicable to various networks with different properties and achieves high-quality overlapping community detection.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

High-compressed deepfake video detection with contrastive spatiotemporal distillation

Yizhe Zhu, Chunhui Zhang, Jialin Gao, Xin Sun, Zihan Rui, Xi Zhou

Summary: In this work, the authors propose a Contrastive Spatio-Temporal Distilling (CSTD) approach to improve the detection of high-compressed deepfake videos. The approach leverages spatial-frequency cues and temporal-contrastive alignment to fully exploit spatiotemporal inconsistency information.

NEUROCOMPUTING (2024)

Review Computer Science, Artificial Intelligence

A review of coverless steganography

Laijin Meng, Xinghao Jiang, Tanfeng Sun

Summary: This paper provides a review of coverless steganographic algorithms, including the development process, known contributions, and general issues in image and video algorithms. It also discusses the security of coverless steganography from theoretical analysis to actual investigation for the first time.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Confidence-based interactable neural-symbolic visual question answering

Yajie Bao, Tianwei Xing, Xun Chen

Summary: Visual question answering requires processing multi-modal information and effective reasoning. Neural-symbolic learning is a promising method, but current approaches lack uncertainty handling and can only provide a single answer. To address this, we propose a confidence based neural-symbolic approach that evaluates NN inferences and conducts reasoning based on confidence.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

A framework-based transformer and knowledge distillation for interior style classification

Anh H. Vo, Bao T. Nguyen

Summary: Interior style classification is an interesting problem with potential applications in both commercial and academic domains. This project proposes a method named ISC-DeIT, which combines data-efficient image transformer architectures and knowledge distillation, to address the interior style classification problem. Experimental results demonstrate a significant improvement in predictive accuracy compared to other state-of-the-art methods.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Improving robustness for vision transformer with a simple dynamic scanning augmentation

Shashank Kotyan, Danilo Vasconcellos Vargas

Summary: This article introduces a novel augmentation technique called Dynamic Scanning Augmentation to improve the accuracy and robustness of Vision Transformer (ViT). The technique leverages dynamic input sequences to adaptively focus on different patches, resulting in significant changes in ViT's attention mechanism. Experimental results demonstrate that Dynamic Scanning Augmentation outperforms ViT in terms of both robustness to adversarial attacks and accuracy against natural images.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Introducing shape priors in Siamese networks for image classification

Hiba Alqasir, Damien Muselet, Christophe Ducottet

Summary: The article proposes a solution to improve the learning process of a classification network by providing shape priors, reducing the need for annotated data. The solution is tested on cross-domain digit classification tasks and a video surveillance application.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Neural dynamics solver for time-dependent infinity-norm optimization based on ACP framework with robot application

Dexiu Ma, Mei Liu, Mingsheng Shang

Summary: This paper proposes a method using neural dynamics solvers to solve infinity-norm optimization problems. Two improved solvers are constructed and their effectiveness and superiority are demonstrated through theoretical analysis and simulation experiments.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

cpp-AIF: A multi-core C plus plus implementation of Active Inference for Partially Observable Markov Decision Processes

Francesco Gregoretti, Giovanni Pezzulo, Domenico Maisto

Summary: Active Inference is a computational framework that uses probabilistic inference and variational free energy minimization to describe perception, planning, and action. cpp-AIF is a header-only C++ library that provides a powerful tool for implementing Active Inference for Partially Observable Markov Decision Processes through multi-core computing. It is cross-platform and improves performance, memory management, and usability compared to existing software.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Predicting stock market trends with self-supervised learning

Zelin Ying, Dawei Cheng, Cen Chen, Xiang Li, Peng Zhu, Yifeng Luo, Yuqi Liang

Summary: This paper proposes a novel stock market trends prediction framework called SMART, which includes a self-supervised stock technical data sequence embedding model S3E. By training with multiple self-supervised auxiliary tasks, the model encodes stock technical data sequences into embeddings and uses the learned sequence embeddings for predicting stock market trends. Extensive experiments on China A-Shares market and NASDAQ market prove the high effectiveness of our model in stock market trends prediction, and its effectiveness is further validated in real-world applications in a leading financial service provider in China.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

DHGAT: Hyperbolic representation learning on dynamic graphs via attention networks

Hao Li, Hao Jiang, Dongsheng Ye, Qiang Wang, Liang Du, Yuanyuan Zeng, Liu Yuan, Yingxue Wang, C. Chen

Summary: DHGAT1, a dynamic hyperbolic graph attention network, utilizes hyperbolic metric properties to embed dynamic graphs. It employs a spatiotemporal self-attention mechanism and weighted node representations, resulting in excellent performance in link prediction tasks.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Progressive network based on detail scaling and texture extraction: A more general framework for image deraining

Jiehui Huang, Zhenchao Tang, Xuedong He, Jun Zhou, Defeng Zhou, Calvin Yu-Chian Chen

Summary: This study proposes a progressive learning multi-scale feature blending model for image deraining tasks. The model utilizes detail dilation and texture extraction to improve the restoration of rainy images. Experimental results show that the model achieves near state-of-the-art performance in rain removal tasks and exhibits better rain removal realism.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Stabilization and synchronization control for discrete-time complex networks via the auxiliary role of edges subsystem

Lizhi Liu, Zilin Gao, Yinhe Wang, Yongfu Li

Summary: This paper proposes a novel discrete-time interconnected model for depicting complex dynamical networks. The model consists of nodes and edges subsystems, which consider the dynamic characteristic of both nodes and edges. By designing control strategies and coupling modes, the stabilization and synchronization of the network are achieved. Simulation results demonstrate the effectiveness of the proposed methods.

NEUROCOMPUTING (2024)