4.7 Article

Quantum-inspired multimodal fusion for video sentiment analysis

期刊

INFORMATION FUSION
卷 65, 期 -, 页码 58-71

出版社

ELSEVIER
DOI: 10.1016/j.inffus.2020.08.006

关键词

Multimodal sentiment analysis; Quantum theory; Machine learning

资金

  1. Quantum Information Access and Retrieval Theory (QUARTZ) project from the European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant [721321]

向作者/读者索取更多资源

The study addresses the challenge of feature fusion for multimodal sentiment analysis by proposing a new quantum-inspired framework. Utilizing a complex-valued neural network, the model achieves comparable results to state-of-the-art systems in benchmarking video sentiment analysis datasets.
We tackle the crucial challenge of fusing different modalities of features for multimodal sentiment analysis. Mainly based on neural networks, existing approaches largely model multimodal interactions in an implicit and hard-to-understand manner. We address this limitation with inspirations from quantum theory, which contains principled methods for modeling complicated interactions and correlations. In our quantum-inspired framework, the word interaction within a single modality and the interaction across modalities are formulated with superposition and entanglement respectively at different stages. The complex-valued neural network implementation of the framework achieves comparable results to state-of-the-art systems on two benchmarking video sentiment analysis datasets. In the meantime, we produce the unimodal and bimodal sentiment directly from the model to interpret the entangled decision.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据