4.7 Article

Multi-label, multi-task CNN approach for context-based emotion recognition

期刊

INFORMATION FUSION
卷 76, 期 -, 页码 422-428

出版社

ELSEVIER
DOI: 10.1016/j.inffus.2020.11.007

关键词

Emotion recognition; Loss function; Multi-task machine learning; Deep learning; Unbalanced data

资金

  1. Polytechnic University of Haut de France
  2. Haut de France region

向作者/读者索取更多资源

This paper proposes a new deep learning architecture for context-based multi-label multi-task emotion recognition, with a key focus on the new loss function called multi-label focal loss (MFL). Experimental results demonstrate that the combination of MFL with Huber loss performs the best, outperforming other combinations of loss functions, and excelling particularly on less frequent labels.
This paper proposes a new deep learning architecture for context-based multi-label multi-task emotion recognition. The architecture is built from three main modules: (1) a body features extraction module, which is a pre-trained Xception network, (2) a scene features extraction module, based on a modified VGG16 network, and (3) a fusion-decision module. Moreover, three categorical and three continuous loss functions are compared in order to point out the importance of the synergy between loss functions when it comes to multi-task learning. Then, we propose a new loss function, the multi-label focal loss (MFL), based on the focal loss to deal with imbalanced data. Experimental results on EMOTIC dataset show that MFL with the Huber loss gave better results than any other combination and outperformed the current state of art on the less frequent labels.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据