4.6 Article

Real-Time Hand Gesture Spotting and Recognition Using RGB-D Camera and 3D Convolutional Neural Network

期刊

APPLIED SCIENCES-BASEL
卷 10, 期 2, 页码 -

出版社

MDPI
DOI: 10.3390/app10020722

关键词

hand gesture spotting and recognition; 3DCNN; human-computer interaction

资金

  1. Basic Science Research Program through the National Research Foundation of Korea (NRF) - Ministry of Education [NRF-2018R1D1A3B05049058, NRF-2017R1A4A1015559]

向作者/读者索取更多资源

Using hand gestures is a natural method of interaction between humans and computers. We use gestures to express meaning and thoughts in our everyday conversations. Gesture-based interfaces are used in many applications in a variety of fields, such as smartphones, televisions (TVs), video gaming, and so on. With advancements in technology, hand gesture recognition is becoming an increasingly promising and attractive technique in human-computer interaction. In this paper, we propose a novel method for fingertip detection and hand gesture recognition in real-time using an RGB-D camera and a 3D convolution neural network (3DCNN). This system can accurately and robustly extract fingertip locations and recognize gestures in real-time. We demonstrate the accurateness and robustness of the interface by evaluating hand gesture recognition across a variety of gestures. In addition, we develop a tool to manipulate computer programs to show the possibility of using hand gesture recognition. The experimental results showed that our system has a high level of accuracy of hand gesture recognition. This is thus considered to be a good approach to a gesture-based interface for human-computer interaction by hand in the future.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Information Systems

Real-time virtual mouse system using RGB-D images and fingertip detection

Dinh-Son Tran, Ngoc-Huynh Ho, Hyung-Jeong Yang, Soo-Hyung Kim, Guee Sang Lee

Summary: The study introduces a novel virtual-mouse method using RGB-D images and fingertip detection, which can track fingertip location in real-time and operate efficiently in real-world environments, enabling easy interaction with computers through gestures.

MULTIMEDIA TOOLS AND APPLICATIONS (2021)

Article Chemistry, Multidisciplinary

Multi-Task Learning for Medical Image Inpainting Based on Organ Boundary Awareness

Minh-Trieu Tran, Soo-Hyung Kim, Hyung-Jeong Yang, Guee-Sang Lee

Summary: The restoration of medical images is crucial for accurate diagnosis, and the multi-task learning model proposed in this paper achieves innovative results in medical image inpainting, generating more realistic and believable images. The method shows excellent performance in quantitative evaluation and brings new possibilities to the field of medical image analysis.

APPLIED SCIENCES-BASEL (2021)

Article Medicine, General & Internal

Multi-Level Seg-Unet Model with Global and Patch-Based X-ray Images for Knee Bone Tumor Detection

Nhu-Tai Do, Sung-Taek Jung, Hyung-Jeong Yang, Soo-Hyung Kim

Summary: This study aims to classify and segment knee bone tumors using deep learning, proposing the Seg-Unet model and building a corresponding dataset, with experiments showing superior performance in accuracy.

DIAGNOSTICS (2021)

Article Chemistry, Analytical

EEG-Based Emotion Recognition by Convolutional Neural Network with Multi-Scale Kernels

Tran-Dac-Thinh Phan, Soo-Hyung Kim, Hyung-Jeong Yang, Guee-Sang Lee

Summary: This study focuses on emotion recognition based on EEG signals, utilizing time domain features and the correlation between 32 channels and frequency bands to enhance prediction performance. Using a 2D CNN structure to process EEG signals enables learning of relationships between local and global patterns of channels.

SENSORS (2021)

Article Chemistry, Analytical

Esophagus Segmentation in CT Images via Spatial Attention Network and STAPLE Algorithm

Minh-Trieu Tran, Soo-Hyung Kim, Hyung-Jeong Yang, Guee-Sang Lee, In-Jae Oh, Sae-Ryung Kang

Summary: This study proposes a fully automated framework for esophagus segmentation, addressing the challenges of its small size, ambiguous boundary, and low contrast in CT images. By utilizing spatial attention mechanism and advanced modules, the method effectively locates the esophagus and enhances segmentation performance. Optimization using the STAPLE algorithm improves Dice and Hausdorff Distance scores for segmentation results.

SENSORS (2021)

Article Chemistry, Multidisciplinary

3D Skeletal Joints-Based Hand Gesture Spotting and Classification

Ngoc-Hoang Nguyen, Tran-Dac-Thinh Phan, Soo-Hyung Kim, Hyung-Jeong Yang, Guee-Sang Lee

Summary: This paper introduces a novel approach to continuous dynamic hand gesture recognition, utilizing two main modules: gesture spotting and gesture classification. By combining different data channels and conducting experiments on multiple public datasets, the approach shows promising performance improvements in gesture recognition.

APPLIED SCIENCES-BASEL (2021)

Article Computer Science, Artificial Intelligence

Predicting progression of Alzheimer's disease using forward-to-backward bi-directional network with integrative imputation

Ngoc-Huynh Ho, Hyung-Jeong Yang, Jahae Kim, Duy-Phuong Dao, Hyuk-Ro Park, Sudarshan Pant

Summary: This study proposes a progressive recurrent network model for predicting clinical diagnoses and phenotypic measurements of Alzheimer's disease patients using longitudinal data. The model utilizes magnetic resonance imaging data and an imputation module to compensate for missing observations in the data.

NEURAL NETWORKS (2022)

Article Chemistry, Multidisciplinary

ROS-Based Unmanned Mobile Robot Platform for Agriculture

Eu-Tteum Baek, Dae-Yeong Im

Summary: The paper proposes a method to design and operate a mobile robot platform for use in a greenhouse, which is expected to increase productivity and reduce labor costs.

APPLIED SCIENCES-BASEL (2022)

Article Computer Science, Information Systems

The Efficacy of Shape Radiomics and Deep Features for Glioblastoma Survival Prediction by Deep Learning

Dang-Linh Trinh, Soo-Hyung Kim, Hyung-Jeong Yang, Guee-Sang Lee

Summary: Glioblastoma is a highly aggressive brain malignancy, and accurate survival prediction is crucial for diagnosis and treatment planning. This study demonstrates the effectiveness of features extracted from whole tumor and enhancing tumor in predicting overall survival. Optimal shape radiomics features and deep features are selected and combined with clinical information to build a regression model. The proposed method achieves promising results in accuracy and mean squared error metrics.

ELECTRONICS (2022)

Article Chemistry, Analytical

Self-Relation Attention and Temporal Awareness for Emotion Recognition via Vocal Burst

Dang-Linh Trinh, Minh-Cong Vo, Soo-Hyung Kim, Hyung-Jeong Yang, Guee-Sang Lee

Summary: This paper proposes a self-relation attention and temporal awareness (SRA-TA) module to tackle the problem of emotion recognition via non-verbal speech. The proposed method achieves a high performance on the test set, ranking first in the 2022 ACII Affective Vocal Burst Workshop & Challenge.

SENSORS (2023)

Article Chemistry, Analytical

Robust Human Face Emotion Classification Using Triplet-Loss-Based Deep CNN Features and SVM

Irfan Haider, Hyung-Jeong Yang, Guee-Sang Lee, Soo-Hyung Kim

Summary: Human facial emotion detection is a challenging task in computer vision. This paper proposes a novel and intelligent approach for emotion classification. It uses a customized ResNet18 model with transfer learning and triplet loss function, combined with an SVM classifier to accurately predict facial emotions. Experimental results show that the proposed method achieves high accuracy on JAFFE and MMI datasets.

SENSORS (2023)

Article Health Care Sciences & Services

Survival Prediction Using Transformer-Based Categorical Feature Representation in the Treatment of Diffuse Large B-Cell Lymphoma

Sudarshan Pant, Sae-Ryung Kang, Minhee Lee, Pham-Sy Phuc, Hyung-Jeong Yang, Deok-Hwan Yang

Summary: This study aims to develop a robust survival prediction strategy for diffuse large B-cell lymphoma (DLBCL) using a deep-learning-based approach. The proposed model combines clinical risk factors and Deauville scores in positron-emission tomography/computed tomography at different treatment stages. Results show that the model outperforms existing methods in terms of survival time estimation. The use of Deauville scores during treatment improves prognostic accuracy.

HEALTHCARE (2023)

Article Computer Science, Information Systems

Cross-Modality Learning by Exploring Modality Interactions for Emotion Reasoning

Thi-Dung Tran, Ngoc-Huynh Ho, Sudarshan Pant, Hyung-Jeong Yang, Soo-Hyung Kim, Gueesang Lee

Summary: Humans can determine subtle emotions from various indicators and surroundings. However, existing research on emotion recognition mainly focuses on recognizing the emotions of speakers. Thus, this paper proposes a novel multimodal approach for predicting emotions from missing modalities.

IEEE ACCESS (2023)

Review Health Care Sciences & Services

A Review of Machine Learning and Deep Learning Approaches on Mental Health Diagnosis

Ngumimi Karen Iyortsuun, Soo-Hyung Kim, Min Jhon, Hyung-Jeong Yang, Sudarshan Pant

Summary: Machine learning approaches have been used in the diagnosis and prediction of treatment outcomes for mental health conditions. This study reviewed 33 articles on the use of machine learning and deep learning technologies for the diagnosis of various mental disorders, including schizophrenia, depression, anxiety, bipolar disorder, PTSD, anorexia nervosa, and ADHD. The researchers also discussed the challenges they encountered and provided a list of public datasets.

HEALTHCARE (2023)

Proceedings Paper Computer Science, Artificial Intelligence

Survival Analysis based on Lung Tumor Segmentation using Global Context-aware Transformer in Multimodality

Duy-Phuong Dao, Hyung-Jeong Yang, Ngoc-Huynh Ho, Sudarshan Pant, Soo-Hyung Kim, Guee-Sang Lee, In-Jae Oh, Sae-Ryung Kang

Summary: Lung cancer is the most common type of cancer, and its incidence among young people is increasing. Early diagnosis and treatment are crucial for successful recovery. This study proposes a tumor segmentation model called MAPTransNet and a survival analysis model called MSNet to assist in the diagnosis and prediction of lung cancer patients.

2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) (2022)

暂无数据