4.7 Article

A union of deep learning and swarm-based optimization for 3D human action recognition

Journal

SCIENTIFIC REPORTS
Volume 12, Issue 1, Pages -

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41598-022-09293-8

Keywords

-

Funding

  1. National Agency for Academic Exchange of Poland under the Academic International Partnerships program [PPI/APM/2018/1/00004]
  2. Silesian University of Technology, Gliwice, Poland [09/010/RGJ22/0068]

Ask authors/readers for more resources

This paper proposes a human action recognition model DSwarm-Net based on 3D skeleton data. By encoding skeleton data into images and using deep learning and metaheuristic methods for feature extraction and optimization, competitive results are achieved on multiple HAR datasets.
Human Action Recognition (HAR) is a popular area of research in computer vision due to its wide range of applications such as surveillance, health care, and gaming, etc. Action recognition based on 3D skeleton data allows simplistic, cost-efficient models to be formed making it a widely used method. In this work, we propose DSwarm-Net, a framework that employs deep learning and swarm intelligence-based metaheuristic for HAR that uses 3D skeleton data for action classification. We extract four different types of features from the skeletal data namely: Distance, Distance Velocity, Angle, and Angle Velocity, which capture complementary information from the skeleton joints for encoding them into images. Encoding the skeleton data features into images is an alternative to the traditional video-processing approach and it helps in making the classification task less complex. The Distance and Distance Velocity encoded images have been stacked depth-wise and fed into a Convolutional Neural Network model which is a modified version of Inception-ResNet. Similarly, the Angle and Angle Velocity encoded images have been stacked depth-wise and fed into the same network. After training these models, deep features have been extracted from the pre-final layer of the networks, and the obtained feature representation is optimized by a nature-inspired metaheuristic, called Ant Lion Optimizer, to eliminate the non-informative or misleading features and to reduce the dimensionality of the feature set. DSwarm-Net has been evaluated on three publicly available HAR datasets, namely UTD-MHAD, HDM05, and NTU RGB+D 60 achieving competitive results, thus confirming the superiority of the proposed model compared to state-of-the-art models.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Artificial Intelligence

Deep neural network correlation learning mechanism for CT brain tumor detection

Marcin Wozniak, Jakub Silka, Michal Wieczorek

Summary: Modern medical clinics use computer systems to support medical examinations and detect potential health problems more efficiently. Deep learning approaches have been proven to provide the most precise results in evaluating CT brain scans. In this article, a novel correlation learning mechanism (CLM) is proposed to combine convolutional neural network (CNN) with classic architecture. The support neural network helps CNN to optimize pooling and convolution layers, resulting in faster learning and higher efficiency of the main neural classifier.

NEURAL COMPUTING & APPLICATIONS (2023)

Article Computer Science, Artificial Intelligence

Inverted bell-curve-based ensemble of deep learning models for detection of COVID-19 from chest X-rays

Ashis Paul, Arpan Basu, Mufti Mahmud, M. Shamim Kaiser, Ram Sarkar

Summary: This article discusses the use of deep learning models and an inverted bell-curve weighted ensemble method to assist in the detection of COVID-19 in CXR images. By using transfer learning and retraining models pretrained on the ImageNet dataset, as well as performing weighted average predictions, the accuracy of COVID-19 identification in CXR images can be improved.

NEURAL COMPUTING & APPLICATIONS (2023)

Article Computer Science, Software Engineering

Handwritten Arabic and Roman word recognition using holistic approach

Samir Malakar, Samanway Sahoo, Anuran Chakraborty, Ram Sarkar, Mita Nasipuri

Summary: Handwritten word recognition is an open research problem due to variations in writing style and degraded images. This paper proposes a holistic approach combined with distance calculation and feature descriptors to address the problem. The experimental results demonstrate the effectiveness of the proposed method on standard databases compared to deep learning models.

VISUAL COMPUTER (2023)

Article Computer Science, Artificial Intelligence

Image contrast improvement through a metaheuristic scheme

Souradeep Mukhopadhyay, Sabbir Hossain, Samir Malakar, Erik Cuevas, Ram Sarkar

Summary: This paper introduces a new gray-scale contrast enhancement algorithm, which improves image quality by calculating near-optimal values using the Artificial Electric Field Algorithm (AEFA). Through comparisons with other techniques using standard metrics, simulation results show that the proposed method increases image contrast and enriches image information.

SOFT COMPUTING (2023)

Article Computer Science, Information Systems

Generation of a synthetic handwritten Bangla compound character dataset using a modified conditional GAN architecture

Anubhab Das, Arka Choudhuri, Arpan Basu, Ram Sarkar

Summary: This study proposes a GAN-based method for generating handwritten Bengali compound characters to address data scarcity. The model's performance is evaluated by assessing the quality of generated samples, showing that it outperforms basic AC-GAN architecture and some other existing GAN architectures.

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

Correction Computer Science, Artificial Intelligence

Human activity recognition from sensor data using spatial attention-aided CNN with genetic algorithm (Oct, 10.1007/s00521-022-07911-0, 2022)

Apu Sarkar, S. K. Sabbir Hossain, Ram Sarkar

NEURAL COMPUTING & APPLICATIONS (2023)

Article Computer Science, Artificial Intelligence

A hierarchical feature selection strategy for deepfake video detection

Sk Mohiuddin, Khalid Hassan Sheikh, Samir Malakar, Juan D. Velasquez, Ram Sarkar

Summary: Digital face manipulation has become a significant concern recently due to its harmful effects on society, particularly for high-profile celebrities who can easily be targeted using apps like FaceSwap and FaceApp. Detecting deepfake images or videos is challenging, and existing models often fail to check for irrelevant or redundant features. In this study, a hierarchical feature selection (HFS) method using a hybrid population-based meta-heuristic model and a single solution-based meta-heuristic model was proposed. The model achieved high AUC scores on three publicly available datasets and outperformed most state-of-the-art methods.

NEURAL COMPUTING & APPLICATIONS (2023)

Article Computer Science, Information Systems

Spline Interpolation and Deep Neural Networks as Feature Extractors for Signature Verification Purposes

Wei Wei, Qiao Ke, Dawid Polap, Marcin Wozniak

Summary: Digital security in modern systems often relies on biometric methods, and new implementations continue to emerge. This can be seen in various applications, such as signing for a courier package pick-up. However, signature verification is a complex process due to variations in size, angle, and writing conditions. Therefore, new methods are constantly needed to evaluate signatures. In this article, the authors propose the use of spline interpolation and two types of artificial neural networks to verify the identity of a person based on selected local and global features extracted from signature images. Experimental results on the SVC2004 database demonstrate an accuracy of 87.7%.

IEEE INTERNET OF THINGS JOURNAL (2023)

Article Medicine, General & Internal

A Multi-Stage Approach to Breast Cancer Classification Using Histopathology Images

Arnab Bagchi, Payel Pramanik, Ram Sarkar

Summary: Breast cancer is a deadly disease that affects women worldwide. Early diagnosis and proper treatment can save lives. Breast image analysis, including histopathological image analysis, and computer-aided diagnosis, can help improve efficiency and accuracy in breast cancer detection. In this study, a deep learning-based method was developed to classify breast cancer using histopathological images, achieving high classification accuracy.

DIAGNOSTICS (2023)

Article Computer Science, Artificial Intelligence

KGSR: A kernel guided network for real-world blind super-resolution

Qingsen Yan, Axi Niu, Chaoqun Wang, Wei Dong, Marcin Wozniak, Yanning Zhang

Summary: Deep learning-based methods have achieved remarkable results in the field of super-resolution. However, the limitation of paired training image sets has led researchers to explore self-supervised learning. However, the assumption of inaccurate downscaling kernel functions often leads to degraded results. To address this issue, this paper introduces KGSR, a kernel-guided network that trains both upscaling and downscaling networks to generate high-quality high-resolution images even without knowing the actual downscaling process.

PATTERN RECOGNITION (2024)

Proceedings Paper Computer Science, Artificial Intelligence

Algorithm for Solving Optimal Placement of Routers in Mines

Alan Popiel, Marcin Wozniak

Summary: This paper presents a model and an algorithm to optimize the placement of routers in a network system for the mining industry, with N chambers and N-1 or fewer connections between them. The model considers two types of routers with different signal strengths, and the algorithm has a computational complexity of O(n(2), as tested on sample graph structures.

ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2022, PT II (2023)

Proceedings Paper Computer Science, Artificial Intelligence

BiLSTM Deep Learning Model for Heart Problems Detection

Jakub Silka, Michal Wieczorek, Martyna Kobielnik, Marcin Wozniak

Summary: Deep learning architectures are used for demanding analysis of complex data inputs, where regular neural networks may encounter issues. In this article, we propose a deep learning model based on a BiLSTM neural network architecture. The proposed model is trained using the Adam algorithm, and we also examine other latest algorithms to determine the best configuration. Results show that our proposed BiLSTM deep learning neural network achieves over 99% accuracy.

ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2022, PT I (2023)

Article Computer Science, Information Systems

Brain Tumor Categorization and Retrieval Using Deep Brain Incep Res Architecture Based Reinforcement Learning Network

Jyotismita Chaki, Marcin Wozniak

Summary: This study proposes a reinforcement learning agent that can interact with brain tumor images to retrieve and categorize similar images. The proposed method utilizes a novel architecture and binary coding technique, as well as fuzzy logic-based sample generation, to improve brain tumor classification and retrieval.

IEEE ACCESS (2023)

Article Computer Science, Artificial Intelligence

Fuzzy logic type-2 intelligent moisture control system

Marcin Wozniak, Jozef Szczotka, Andrzej Sikora, Adam Zielonka

Summary: This article presents a model of adjustable moisture control for historical buildings, utilizing a flexible IoT infrastructure and type-2 fuzzy logic reasoning to create an innovative intelligent system for interior conditions control. The developed system, tested in an old brewery building, showed efficient dehumidification results at a low cost.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Information Systems

Feature Selection Using Selective Opposition Based Artificial Rabbits Optimization for Arrhythmia Classification on Internet of Medical Things Environment

G. S. Nijaguna, N. Dayananda Lal, Parameshachari Bidare Divakarachari, Rocio Perez de Prado, Marcin Wozniak, Raj Kumar Patra

Summary: This research combines the Internet of Medical Things and artificial intelligence to develop a method for monitoring and diagnosing cardiac arrhythmia. By extracting various features from electrocardiogram signals and using an Auto Encoder and Selective Opposition algorithm, a classification system is built. The classified results are interpreted using the Shapley additive explanations. The experimental results show that the proposed method achieves higher accuracy compared to other existing methods.

IEEE ACCESS (2023)

No Data Available