☆ 4.6 Article

On the role of multimodal learning in the recognition of sign language

MULTIMEDIA TOOLS AND APPLICATIONS (2019)

Journal

MULTIMEDIA TOOLS AND APPLICATIONS

Volume 78, Issue 8, Pages 10035-10056

Publisher

SPRINGER

DOI: 10.1007/s11042-018-6565-5

Keywords

Sign language recognition; Multimodal learning; Convolutional neural networks; Kinect; Leap motion

Funding

Protect NanoSTIMA: Macro-to-Nano Human Sensing: Towards Integrated Multimodal Health Monitoring and Analytics - North Portugal Regional Operational Programme (NORTE 2020), under PORTUGAL 2020 Partnership Agreement [NORTE010145-FEDER000016]
European Regional Development FUND (ERDF)
FundacAo para a Ciencia e a Tecnologia (FCT) [SFRH/BD/102177/2014, SFRH/BPD/101439/2014]
Fundação para a Ciência e a Tecnologia [SFRH/BD/102177/2014] Funding Source: FCT

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Sign Language Recognition (SLR) has become one of the most important research areas in the field of human computer interaction. SLR systems are meant to automatically translate sign language into text or speech, in order to reduce the communicational gap between deaf and hearing people. The aim of this paper is to exploit multimodal learning techniques for an accurate SLR, making use of data provided by Kinect and Leap Motion. In this regard, single-modality approaches as well as different multimodal methods, mainly based on convolutional neural networks, are proposed. Our main contribution is a novel multimodal end-to-end neural network that explicitly models private feature representations that are specific to each modality and shared feature representations that are similar between modalities. By imposing such regularization in the learning process, the underlying idea is to increase the discriminative ability of the learned features and, hence, improve the generalization capability of the model. Experimental results demonstrate that multimodal learning yields an overall improvement in the sign recognition performance. In particular, the novel neural network architecture outperforms the current state-of-the-art methods for the SLR task.

On the role of multimodal learning in the recognition of sign language

Journal

MULTIMEDIA TOOLS AND APPLICATIONS

Publisher

SPRINGER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

On the role of multimodal learning in the recognition of sign language

Journal

MULTIMEDIA TOOLS AND APPLICATIONS

Publisher

SPRINGER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper