4.7 Article

Bagged support vector machines for emotion recognition from speech

期刊

KNOWLEDGE-BASED SYSTEMS
卷 184, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.knosys.2019.104886

关键词

Speech emotion recognition; Machine learning; Ensemble learning

资金

  1. Infosys Center for AI, IIIT-Delhi
  2. ECRA Grant by SERB, Government of India [ECR/2018/002449]

向作者/读者索取更多资源

Speech emotion recognition, a highly promising and exciting problem in the field of Human Computer Interaction, has been studied and analyzed over several decades. It concerns the task of recognizing a speaker's emotions from their speech recordings. Recognizing emotions from speech can go a long way in determining a person's physical and psychological state of well-being. In this work we performed emotion classification on three corpora the - Berlin EmoDB, the Indian Institute of Technology Kharagpur Simulated Emotion Hindi Speech Corpus (IITKGP-SEHSC), and the Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS). A combination of spectral features was extracted from them which was further processed and reduced to the required feature set. Ensemble learning has been proven to give superior performance compared to single estimators. We propose a bagged ensemble comprising of support vector machines with a Gaussian kernel as a viable algorithm for the problem at hand. We report the results obtained on the three datasets mentioned above. (C) 2019 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据